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HYBRBD EXPRESSION OF NEISSERIAL PROTEINS 

All documents cited herein are incorporated by reference in their entirety. 

TECHNICAL FIELD 

This invention is in the field of protein expression. In particular, it relates to the heterologous 
5 expression of proteins from Neisseria (e.g. N. gonorrhoeae or, preferably, N. meningitidis). 

BACKGROUND ART 

International patent applications W099/24578, W099/36544, WO99/57280 and 
WOOO/22430 disclose proteins from Neisseria meningitidis and Neisseria gonorrhoeae. 
These proteins are typically described as being expressed in Kcoli (i.e. heterologous 
10 expression) as either N-terminal GST-fusions or C-terminal His-tag fusions, although other 
expression systems, including expression in native Neisseria, are also disclosed. 

It is an object of the present invention to provide alternative and improved approaches for 
the heterologous expression of these proteins. These approaches will typically affect the 
level of expression, the ease of purification, the cellular localisation of expression, and/or the 
15 immunological properties of the expressed protein. 

DISCLOSURE OF THE INVENTION 

In accordance with the invention, two or more (e.g. 3, 4, 5, 6 or more) proteins of the 
invention are expressed as a single hybrid protein. It is preferred that no non-Neisserial 
fusion partner (e.g. GST or poly-His) is used. 

20 This offers two advantages. Firstly, a protein that may be unstable or poorly expressed on its 
own can be assisted by adding a suitable hybrid partner that overcomes the problem. 
Secondly, commercial manufacture is simplified - only one expression and purification need 
be employed in order to produce two separately-useful proteins. 

Thus the invention provides a method for the simultaneous heterologous expression of two 
25 or more proteins of the invention, in which said two or more proteins of the invention are 
fused (i.e. they are translated as a single polypeptide chain). 

The method will typically involve the steps of: obtaining a first nucleic acid encoding a first 
protein of the invention; obtaining a second nucleic acid encoding a second protein of the 
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invention; ligating the first and second nucleic acids. The resulting nucleic acid may be 
inserted into an expression vector, or may already be part of an expression vector. 

Where just two proteins are joined, the hybrid protein can be represented simply by the 
formula NH 2 -A-B-COOH. A and B can each be selected from any Neisserial proteins, and 
in particular those represented by SEQ#s M326. The method is well suited to the expression 
of proteins orfl, orf4, orf25, orf40, Orf46/46.1, orf83, 233, 287, 292L, 564, 687, 741, 907, 
919, 953, 961 and 983. 

The 42 hybrids indicated by 'X' in the following table of form NH 2 -A-B-COOH are 
preferred: 



■IA ; : B-+ 


Orf46.1 


287 


741 


919 


953 


961 


983 


ORF46:l 
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X 
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X 
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X 
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X 


X 
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X 
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X 
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X 
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X 





10 



* r — «jr W xi«o ait. uius vwro.i, i, /<*L, my, y^j, ybl and 

983. These may be used in their essentially full-length form, or poly-glycine deletions (AG) 
forms may be used (e.g. AG-287, AGTbp2, AG741, AG983 etc.), or truncated forms may be 
used (e.g. Al-287, A2-287 etc.), or domain-deleted versions may be used (e.g. 287B, 287C, 
287BC, ORF46M33, ORF46 43 3-608, ORF46, 961c etc.) and so on. 

Particularly preferred are: (a) a hybrid protein comprising 919 and 287; (b) a hybrid protein 
comprising 953 and 287; (c) a hybrid protein comprising 287 and ORF46.1; (d) a hybrid 
protein comprising ORF1 and ORF46.1; (e) a hybrid protein comprising 919 and ORF46.1; 
(f) a hybrid protein comprising ORF46.1 and 919; (g) a hybrid protein comprising ORF46.1, 
287 and 919; (h) a hybrid protein comprising 919 and 519; and (i) a hybrid protein 
comprising ORF97 and 225. 



Further embodiments are shown in the drawings and include AG287-919, AG287-953, 
AG287-961, AG983-ORF46.1, AG983-741, AG983-961, AG983-961C, AG741-961, 
AG741-961C, AG741-983, AG741-ORF46.1, ORF46.1-741, ORF46.1-961, ORF46.1-961C, 
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961-ORF46.1, 961-741, 961-983, 961C-ORF46.1, 961C-741, 961C-983, 961CL-ORF46.1, 
961CL-741, and 961CL-983. 

Where 287 is used, it is preferably at the C-terminal end of a hybrid; if it is to be used at the 
N-terminus, if is preferred to use a AG form of 287 is used (e.g. as the N-tenninus of a 
hybrid with ORF46.1, 919, 953 or 961). 

Where 287 is used, this is preferably from strain 2996 or from strain 394/98. 

Where 961 is used, this is preferably at the N-terminus. Domain forms of 961 may be used. 

Alignments of polymorphic forms of ORF46, 287, 919 and 953 are disclosed in 
WOOO/66741 . Any of these polymorphs can be used according to the present invention. 

Preferably, the constituent proteins (A and B) in a hybrid protein according to the invention 
will be from the same strain. 

The fused proteins in the hybrid may be joined directly, or may be joined via a linker peptide 
e.g. via a poly-glycine linker (Le. G„ where n = 3, 4, 5, 6, 7, 8, 9, 10 or more) or via a short 
peptide sequence which facilitates cloning. It is evidently preferred not to join a AG protein 
to the C-terminus of a poly-glycine linker. 

The fused proteins may lack native leader peptides or may include the leader peptide 
sequence of the N-terminal fusion partner. 

Host 

It is preferred to utilise a heterologous host. The heterologous host may be prokaryotic or 
eukaryotic. It is preferably Exoli, but other suitable hosts include Bacillus subtilis, Vibrio 
cholerae, Salmonella typhi, Salmonenna typhimurium, Neisseria meningitidis, Neisseria 
gonorrhoeae, Neisseria lactamica, Neisseria cinerea, Mycobateria (e.g. M.tuberculosis), 
yeast etc. 

Vectors, hosts etc. 

As well as the methods described above, the invention provides (a) nucleic acid and vectors 
useful in these methods (b) host cells containing said vectors (c) proteins expressed or 
expressable by the methods (d) compositions comprising these proteins, which may be 
suitable as vaccines, for instance, or as diagnostic reagents, or as immunogenic compositions 
(e) these compositions for use as medicaments (e.g. as vaccines) or as diagnostic reagents (f) 
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the use of these compositions in the manufacture of (1) a medicament for treating or 
preventing infection due to Neisserial bacteria (2) a diagnostic reagent for detecting the 
presence of Neisserial bacteria or of antibodies raised against Neisserial bacteria, and/or (3) a 
reagent which can raise antibodies against Neisserial bacteria and (g) a method of treating a 
5 patient, comprising administering to the patient a therapeutically effective amount of these 
compositions. 

Sequences 

The invention also provides a protein or a nucleic acid having any of the sequences set out in 
the following examples. It also provides proteins and nucleic acid having sequence identity 
10 to these. As described above, the degree of 'sequence identity' is preferably greater than 
50% (eg. 60%, 70%, 80%, 90%, 95%, 99% or more). 



Nomenclature herein 

The 2166 protein sequences disclosed in W099/24578, W099/36544 and WO99/57280 are 
referred to herein by the following SEQ# numbers: 



Application 


Protein sequences 


SEQ# herein 


W099/24578 


Even SEQ IDs 2-892 


SEQ#s 1-446 


W099/36544 


Even SEQ IDs 2-90 


SEQ#s 447-491 




Even SEQ IDs 2-3020 


SEQ#s 492-2001 


WO99/57280 


Even SEQ IDs 3040-3114 


SEQ#s 2002-2039 




SEQ IDs 3115-3241 


SEQ#s 2040-2166 



15 In addition to this SEQ# numbering, the naming conventions used in W099/24578, 
W099/36544 and WO99/57280 are also used (e.g. 'ORF4\ 'ORF40', 'ORF40-1' etc. as 
used in W099/24578 and W099/36544; 'm919\ 'g919' and 'a919' etc. as used in 
WO99/57280). 

The 2160 proteins NMB0001 to NMB2160 from Tettelin et al. [Science (2000) 287:1809- 
20 1815] are referred to herein as SEQ#s 2167-4326 [see also WO00/66791]. 

The term 'protein of the invention' as used herein refers to a protein comprising: 

(a) one of sequences SEQ#s 1-4326; or 

(b) a sequence having sequence identity to one of SEQ#s 1-4326; or 

(c) a fragment of one of SEQ#s 1-4326. 
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The degree of 'sequence identity' referred to in (b) is preferably greater than 50% (eg. 60%, 
70%, 80%, 90%, 95%, 99% or more). This includes mutants and allelic variants [e.g. see 
WO00/66741]. Identity is preferably determined by the Smith-Waterman homology search 
algorithm as implemented in the MPSRCH program (Oxford Molecular), using an affine gap 
search with parameters gap open penalty-12 and gap extension penalty =1. Typically, 50% 
identity or more between two proteins is considered to be an indication of functional 
equivalence. 

The 'fragment' referred to in (c) should comprise at least n consecutive amino acids from 
one of SEQ#s 1-4326 and, depending on the particular sequence, n is 7 or more (eg. 8, 10, 
12, 14, 16, 18, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100 or more). Preferably the fragment 
comprises an epitope from one of SEQ#s 1-4326. Preferred fragments are those disclosed in 
WO00/71574 and WO01/04316. 

Preferred proteins of the invention are found in N. meningitidis serogroup B. 

Preferred proteins for use according to the invention are those of serogroup B N. meningitidis 
strain 2996 or strain 394/98 (a New Zealand strain). Unless otherwise stated, proteins 
mentioned herein are from N. meningitidis strain 2996. It will be appreciated, however, that 
the invention is not in general limited by strain. References to a particular protein (e.g. '287', 
'919' etc.) may be taken to include that protein from any strain. 

It will be appreciated that references to "nucleic acid" includes DNA and RNA, and also 
their analogues, such as those containing modified backbones, and also peptide nucleic acids 
(PNA) etc. 

BRIEF DESCRIPTION OF DRAWINGS 

Figures 1 to 26 show hybrid proteins according to the invention. 

MODES FOR CARRYING OUT THE INVENTION 
Example 1 - hybrids ofORF46 

The complete ORF46 protein from N. meningitidis (serogroup B, strain 2996) has the 
following sequence: 

1 LGISRKISLI LSIIAVCLPM HAHASDLAND SFIRQVLDRQ HFEPDGKYHL 
51 FGSRGELAER SGHIGLGKIQ SHQLGNLMIQ QAAIKGNIGY IVRFSDHGHE 
101 VHSPFDNHAS HSDSDEAGSP VDGFSLYRIH WDGYEHHPAD GYDGPQGGGY 
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151 PAPKGARDIY SYDIKGVAQN IRLNLTDNRS TGQRLADRFH NAGSMLTQGV 

201 GDGFKRATRY SPELDRSGNA AEAFNGTADI VKNIIGAAGE IVGAGDAVQG 

251 ISEGSNIAVM HGLGLLSTEN KMARINDLAD MAQLKDYAAA AIRDWAVQNP 

301 NAAQGIEAVS NIFMAAIPIK GIGAVRGKYG LGGITAHPIK RSQMGAIALP 

5 351 KGKSAVSDNF ADAAYAKYPS PYHSRNIRSN LEQRYGKENI TSSTVPPSNG 

401 KNVKLADQRH PKTGVPFDGK GFPNFEKHVK YDTKLDIQEL SGGGIPKAKP 

451 VSDAKPRWEV DRKLNKLTTR EQVEKNVQEI RNGNKKSNFS QHAQLEREIN 

501 KLKSADEINF ADGMGKFTDS MNDKAFSRLV KSVKENGFTN PWEYVEING 

551 KAYIVRGNNR VFAAEYLGRI HELKFKKVDF PVPNTSWKNP TDVLNESGNV 

10 601 KRPRYRSK* 

The leader peptide is underlined. 

The sequences of ORF46 from other strains can be found in WO00/66741 . 

ORF46 has been fused at its C-terminus and N-terminus with 287, 919, and ORF1. The 
15 hybrid proteins were generally insoluble, but gave some good ELISA and bactericidal results 
(against the homologous 2996 strain): 1 



Protein 


ELISA 


Bactericidal Ab 


Orfl-Orf46.1-His 


850 


256 


919-Orf46.1-His 


12900 


512 


919-287-Orf46-His 


n.d. 


D.d. 


Orf46.1-287His 


150 


8192 


Orf46.1-919ffis 


2800 


2048 


Orf46.1-287-919ffis 


3200 


16384 



For comparison, 'triple* hybrids of ORF46.1, 287 (either as a GST fusion, or in AG287 
form) and 919 were constructed and tested against various strains (including the homologous 
2996 strain) versus a simple mixture of the three antigens. FCA was used as adjuvant: 





2996 


BZ232 


MC58 


NGH38 


F6124 


BZ133 


Mixture 


8192 


256 


512 


1024 


>2048 


>2048 


ORF46.1-287-919his 


16384 


256 


4096 


8192 


8192 


8192 


AG287-919-ORF46.1his 


8192 


64 


4096 


8192 


8192 


16384 


AG287-ORF46.1-919his 


4096 


128 


256 


8192 


512 


1024 



20 Again, the hybrids show equivalent or superior immunological activity. 

Hybrids of two proteins (strain 2996) were compared to the individual proteins against 
various heterologous strains: 





1000 


MC58 


F6124 (MenA) 


ORF46.1-His 


<4 


4096 


<4 


ORFl-His 


8 


256 


128 


ORF1— ORF46.1-His 


1024 


512 


1024 



Again, the hybrid shows equivalent or superior immunological activity. 
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Example 2 - hybrids ofAG287 

The deletion of the (Gly) 6 sequence in 287 was found to have a dramatic effect on protein 
expression. The protein lacking the N-terminal amino acids up to GGGGGG is called 
'AG287'. In strain MC58, its basic sequence (leader peptide underlined) is: 

SPDVKS ADTLSKPAAP WSEKETEAK EDAPQAGSQG QGAPSAQGSQ DMAAVSEENT 
GNGGAVTADN PKNBDEVAQN DMPQNAAGTD SSTPNHTPDP NMLAGNMBNQ ATDAGESSQP 
ANQPDMANAA DGMQGDDPSA GGQNAGNTAA QGANQAGNNQ AAGSSDPIPA SNPAPANGGS 
NFGRVDLANG VLIDGPSQNI TLTHCKGDSC SGNNFLDEEV QLKSEFEKLS DADKISNYKK 
DGKNDKFVGL VADSVQMKGI NQYIIFYKPK PTSFARPRRS ARSRRSLPAE MPLIPVNOAD 
TLIVDGEAVS LTGHSGNIFA PEGNYRYLTY GAEKLPGGSY ALRVQGEPAK GEMLAGAAVY 
NGEVLHFHTE NGRPYPTRGR FAAKVDFGSK SVDGIIDSGD DLHMGTQKFK AAIDGNGFKG 
TWTENGSGDV SGKFYGPAGE EVAGKYSYRP TDAEKGGFGV FAGKKEQD* 

15 AG287, with or without His-tag ('AG287-His' and «AG287K\ respectively), are expressed at 
very good levels in comparison with the '287-His' or '287 onta ge ed '. 

On the basis of gene variability data, variants of AG287-His were expressed in Exoli from a 
number of MenB strains, in particular from strains 2996, MC58, 1000, and BZ232. The 
results were also good - each of these gave high ELISA litres and also serum bactericidal 
litres of >8192. AG287K, expressed from pET-24b, gave excellent titres in ELISA and the 
serum bactericidal assay. 



10 



20 



Deletion of poly-Gly sequences is also applicable to Tbp2 (NMB0460), 741 (NMB 1870) 
and 983 (NMB1969). When cloned in pET vector and expressed in Ecoli without the 
sequence coding for their leader peptides and without poly-Gly (Le. as "AG forms"), the 
25 same effect was seen - expression was good in the clones carrying the deletion of the 
poly-glycine stretch, and poor or absent if the glycines were present in the expressed protein. 

AG287 was fused directly in-frame upstream of 919, 953, 961 (sequences shown below) and 
ORF46.1: 



30 



35 



40 



AG287-919 
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GAAAAGGGCGGATTCGGCGTGTTTGCCGGCAAAAAAGAGCAGGATGGAT 

ATCCAAACCTITCCGCAACCCGACACATCCGTCATCAACGGCCCGGACC 

ACX^CGGTCGGCGGCGGCGGGGCCGTCTATACCGTTGTACCGCACCTGTC^ 

GCCAAAAGCCTGCAATCCTTCCGCCTCGGCTGCGCCAATTTGAAAAACCGC 

GCCTTTCAAACCCCCGTCCATTCCTTTCAGGCAAAA^ 

AACGGAAGCCTTGCCGGTACGGTTACCGGCTATTACGAGCCGGTGCTGAAGGGCC^ 

CGCTTCCCGATTTACGGTATTCCCGACGATTTTATCTCCGT^ 

GTCCGCATCAGGCAGACGGGAAAAAACAGCGGCACAATCGACAATACCGGCGGC^ 

TTCCCCATCACCGCGCGCACAACGGCAATCAAAGGCAGGTTTGAAG 

CAAATCAACGGCGGCGCGCTTGACGGCAAAGCCCCGATACT 

CACATCC^GGCTCGGGCCGTCTGAAAACCCCGTCCGGCAAATACATCCGC^ 

CCCTACGTTTCC^TCGGACGCTATATGGCGGACAAAGGCTACCTCAAG 

GCCTATATGCGGCAAAATCCGCAACGCCTCGCCGAAGTTTTGGOT 

GCCGGAAGCAGCAATGACGGTCCCGTCGGCGCACTGGGCACGCCGTTGATO 

CACTACATTACCTTGGGCGCGCCCTTATTTGTCGCCACCGTCCCATCCGGTTAC 

ATGGCGCAGGATACCGGCAGCGCGATTAAAGGCGCGGTGCGCGTGGATTATTTO 

GAACTTGCCGGCAAACAGAAAACCACGGGTTACGTCTGGCAGCTCCTACCCAACGGTAT 

TAACTCGAG 

1 MASPDVKSAD TLSKPAAPW AEKETEVKED APQAGSQGQG APSTQGSQDM [ 
51 AAVSAENTGN GGAATTDKPK NEDEGPQNDM PQNSAESANQ TGNNQPADSS 
101 DSAPASNPAP ANGGSNFGRV DLANGVLIDG PSQNITLTHC KGDSCNGDNL \ 
151 LDEEAPSKSE FENLNESERI EKYKKDGKSD KFTNLVATAV QANGTNKYVT 
201 IYKDKSASSS SARFRRSARS RRSLPAEMPL IPVNQADTLI VDGEAVSLTG 
251 HSGNIFAPEG NYRYLTYGAE KLPGGSYALR VQGEPAKGEM LAGTAVYNGE ; 
301 VLHFHTBNGR PYPTRGRFAA KVDFGSKSVD GIIDSGDDLH MGTQKFKAAI 
351 DGNGFKGTWT ENGGGDVSGR FYGPAGEEVA GKYSYRPTDA EKGGFGVFAG 
401 KKEQDGSGGG GCQSKSIQTF PQPDTSVING PDRFVGIPDP AGTTVGGGGA 
451 VYTWPHLSL PHWAAQDFAK SLQSFRLGCA NLKNRQGWQD VCAQAFQTPV 
501 HSFQAKQFFB RYFTPWQVAG NGSLAGTVTG YYEPVLKGDD RRTAQARFPI 
551 YGIPDDFISV PLPAGLRSGK ALVRIRQTGK NSGTIDNTGG THTADLSRFP 
601 ITARTTAIKG RFEGSRFLPY HTRNQINGGA LDGKAPILGY AEDPVELFFM 
651 HIQGSGRLKT PSGKYIRIGY ADKNEHPYVS IGRYMADKGY LKLGQTSMQG 
701 IKAYMRQNPQ RLAEVLGQNP SYIFFRELAG SSNDGPVGAL GTPLMGEYAG 
751 AVDRHYI TLG APLFVATAHP VTRKALNRLI MAQDTGSAIK GAVRVDYFWG 
801 YGDEAGELAG KQKTTGYVWQ LLPNGMKPEY RP* 



AG2 87—953 

ATGGCTAGCCCCGATGTTAAATCGGCGGACACGCTGTCAAAACCGTC 

GTAAAAGAAGATGCGCCACAGGCAGGTTCTCAAGGACAGGGCGCGCCATCCACACAA 

GTTTCGGCAGAAAATACAGGCAATGGCGGTGCGGCAAGAACGGA^^ 

GATATGCCGCAAAATTCCGCCGAATCCGCAAATCAAACAG 

GCGTCAAACCCTGCACCTGCGAATGGCGGTAGCAATTTTGGAAGGGTT^ 

CCGTCGCAAAATATAACGTTGACCCACTGTAAAGGCGATTCTTGTAATGGTGATAATTT^ 

TCAAAATCAGAATTTGAAAATTTAAATGAGTCTGAACGAATTGAGAAATATM 

ACTAATTTGGTTGCGACAGCAGTTCAAGCTAATGGAACTAACAAATATC 

TCTTCATCTGCGCGATTCAGGCGTTCTGCACGGTCGAGGAGGTCGCTTC 

CAGGCGGATACGCTGATTGTCGATGGGGAAGCGGTCAGCCTGACGGGGCA 

AATTACCGGTATCTGACTTACGGGGCGGAAAAATTGCCCGGCGGATCGTATGCCCTC 

AAAGGCGAAATGCTTGCTGGCACGGCCGTGTACAACGGCGAAGTGCTGCATTTT 

CCGACTAGAGGCAGGTTTGCCGCAAAAGTCGATTTCGGCAGCAAATC 

TTGCATATGGGTACGCAAAAATTCAAAGCCGCCATCGATGGAAACGGCTOT 

GGGGATGTTTCCGGAAGGTTTTACGGCCCGGCCGGCGAGGAAGTGGCGGG 

GAAAAGGGCGGATTCGGCGTGTTTGCCGGCAAAAAAGAGCAGGATGGATCC 

GACGAATATCACGCCAACGCCCGTTTCGCCATCGACCATTTCAACACCAGCACCAA^ 

ACCGGTTCCGTCGAGTTCGACCAAGCAAAACGCGACGGTAAAATCGACATCACCATCCCCGTTGCCAAC 

GGTTCGCAACACTTTACCGACCACCTGAAATCAGCCGACATCTTCGATGCCGCCCAA 

TCCACCAAATTCAACTTCAACGGCAAAAAACTGGT1TCCGTTGACGGCAACCTGACCA 

GTCAAACTGAAAGCCGAAAAATTCAACTGCTACCAAAGCCCGATGGC^ 

ACCACC^TCGACCGCACGAAATGGGGCGTGGACTACCTCGTTAAC^ 

CAAATCGAGGCAGCCAAACAATAACTCGAG 

1 MASPDVKSAD TLSKPAAPW AEKETEVKED APQAGSQGQG APSTQGSQDM 

51 AAVSAENTGN GGAATTDKPK NEDEGPQNDM PQNSAESANQ TGNNQPADSS 

101 DSAPASNPAP ANGGSNFGRV DLANGVLIDG PSQNITLTHC KGDSCNGDNL 

151- LDEEAPSKSE FENLNESERI EKYKKDGKSD KFTNLVATAV QANGTNKYVI 
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201 IYKDKSASSS SARFRRSARS RRSLPAEMPL IPVNQADTLI VDGEAVSLTG 
251 HSGNIFAPEG NYRYLTYGAE KLPGGSYALR VQGEPAKGEM LAGTAVYNGE 
301 VLHFHTENGR PYPTRGRFAA KVDFGSKSVD GIIDSGDDLH MGTQKFKAAI 
351 DGNGFKGTWT ENGGGDVSGR FYGPAGEEVA GKYSYRPTDA EKGGFGVFAG 
5 401 KKEQDGSGGG GATYKVDEYH ANARFAIDHF NTSTNVGGFY GLTGSVEFDQ 

451 AKRDGKIDIT IFVANLQSGS QHFTDHLKSA DIFDAAQYPD IRFVSTKFNF 
501 NGKKLVSVDG NLTMHGKTAP VKLKAEKFNC YQSPMAKTEV CGGDFSTTID 
551 RTKWGVDYLV NVGMTKSVRI DIQIEAAKQ* 

10 

AG287-961 

ATGGCTAGCCCCGATGTTAAATCGGCGGACACGCTGTCAA 

GTAAAAGAAGATGCGCCACAGGCAGGTTCTGAAGGACAGGGC 

GTTTCGGCAGAAAATACAGGC^TGGCGGTCCGGCAACAACGGACAAACC 
15 GATATGCCGCAAAATTCCGCCGAATCCGCAAATCAAACAGGGAACAACCAACCCGCCGAOT 

(^GTCAAACCCTGCACCTGCGAATGGCGGTAGCAATTTTG 

CCGTCGCAAAATATAACGTTGACCCACTGTAAA^ 

TCAAAATCAGAATTTGAAAATTTAAATGAGTCTGAACG 

ACTAATTTGGTTGCGACAGCAGTTCAAGCTAATGGAACTA^ 
20 TCTTCATCTGCGCGATTCAGGCGTTCTGCACGGTCGAGGAGGTCGCTTCCTGCCGAGA 

CAGGCGGATACGCTGATTGTCGATGGGGAAGCGGTCAGCCTGACGGGGCATTCCG 

AATTACCGGTATCTGACTTACGGGGCGGAAAAATTGCCCG^ 

AAAGGCGAAATGCTTGCTGGCACGGCCGTGTACAACGGCGAA 

CCGACTAGAGGCAGGTTTGCCGCAAAAGTCGAT^ 
25 TTGCATATGGGTACGCAAAAATTCAAAGCCGCCATCGATGGAAAC 

GGGGATGTTTCCGGAAGGTTTTACGGCCCGGCCGGCGAGGAAGTGG^ 

GAAAAGGGCGGATTCGGCGTGTTTGCCGGCAAAAAAGAGCAGGATG 

GATGTTAAAAAAGCTGCCACTGTGGCCATTGCTGCTGCCT 

GAGACCATCTACGACATTGATGAAGACGGCAGAATO 
30 TTTAAAGGTCTGGGTCTGAAAAAAGTCGTGACTAACCTGACCAAAACCGTCAATC 

AAAGTAAAAGCTGCAGAATCTGAAATAGAAAAGTTAACAACCAAGTTA 

GATGCCGCTCTGGATGCAACCACCAACGCCTTGAAT^ 

ACAAATATCGTAAAAATTGATGAAAAATTAGAAGCCGTGGCTGATACCGTCGACAAG^ 
ATCGCCGATTCATTGGATGAAACCAAC^CTAAGGCAGACGAAGCCGTCAAAACCGCCAATC 
35 GAAGAAACCAAACAAAACGTCGATGCCAAAGTAAAAGCTGCAGAAACTC 
ACAGCTAATACTGCAGCCGACAAGGCCGAAGCTGTCGCTGCAAAM 

AAAGATAATATTGCTAAAAAAGCAAACAGTGCCGACGTCTACACCAGAGAAGAGTCTGAC^ 
GATGGTCTGAACGCTACTACCGAAAAATTGGACACACGCTTGGCTTC 
CGCCTGAACGGTTTGGATAAAACAGTGTCAGACCTGCGCAAAGAAACC 
40 TCCGGTCTGTTCCAACCTTACAACGTGGGTCGGTTCAATGTAACGGCTGCAGTCGGCGGCTA 
GTCGCCATCGGTACCGGCTTCCGCTTTACCGAAAACTTTG^ 
TCTTCCGCAGCCTACCATGTCGGCGTCAATTACGAGTGGTAACTCGAG 

1 MASPDVKSAD TLSKPAAPW AEKETEVKED APQAGSQGQG APSTQGSQDM 

45 51 AAVSAENTGN GGAATTDKPK NEDEGPQNDM PQNSAESANQ TGNNQPADSS 

101 DSAPASNPAP ANGGSNFGRV DLANGVLIDG PSQNITLTHC KGDSCNGDNL 

151 LDEEAPSKSE FENLNESERI EKYKKDGKSD KFTNLVATAV QANGTNKYVI 

201 IYKDKSASSS SARFRRSARS RRSLPAEMPL IPVNQADTLI VDGEAVSLTG 

251 HSGNIFAPEG NYRYLTYGAE KLPGGSYALR VQGEPAKGEM LAGTAVYNGE 

50 301 VLHFHTENGR PYPTRGRFAA KVDFGSKSVD GIIDSGDDLH MGTQKFKAAI 

351 DGNGFKGTWT ENGGGDVSGR FYGPAGEEVA GKYSYRPTDA EKGGFGVFAG 

401 KKEQDGSGGG GATNDDDVKK AATVAIAAAY NNGQEINGFK AGETIYDIDE 

451 DGTITKKDAT AADVEADDFK GLGLKKWTN LTKTVNENKQ NVDAKVKAAE 

501 SEIEKLTTKL ADTDAALADT DAALDATTNA LNKLGENITT FAEETKTNIV 

55 551 KIDEKLEAVA DTVDKHAEAF NDIADSLDET NTKADEAVKT ANEAKQTAEE 

601 TKQNVDAKVK AAETAAGKAE AAAGTANTAA DKAEAVAAKV TDIKADIATN 

651 KDNIAKKANS ADVYTREESD SKFVRIDGLN ATTEKLDTRL ASAEKSIADH 

701 DTRLNGLDKT VSDLRKETRQ GLAEQAALSG LFQPYNVGRF NVTAAVGGYK 

751 SESAVAIGTG FRFTENFAAK AGVAVGTSSG SSAAYHVGVN YEW* 

60 
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ELISA 


Bactericidal 


AG287-953-His 


3834 


65536 


AG287-961-His 


108627 


65536 



The bactericidal efficacy (homologous strain) of antibodies raised against the hybrid proteins 
was compared with antibodies raised against simple mixtures of the component antigens 
(using 287-GST) for 919 and ORF46.1: 





Mixture with 287 


Hybrid with AG287 


919 


32000 


128000 


ORF46.1 


128 


16000 



Data for bactericidal activity against heterologous MenB strains and against serotypes A and 
C were also obtained: 





919 


ORF46.1 


Strain 


Mixture 


Hybrid 


Mixture 


Hybrid 


NGH38 


1024 


32000 




16384 


MC58 


512 


8192 




512 


BZ232 


512 


512 






MenA (F6124) 


512 


32000 




8192 


MenC (Cil) 


>2048 


>2048 






MenC (BZ133) 


>4096 


64000 




8192 



The hybrid proteins with AG287 at the N-terminus are therefore immunologically superior to 
simple mixtures, with AG287-ORF46.1 being particularly effective, even against 
heterologous strains. AG287-ORF46. IK may be expressed in pET-24b. 

The same hybrid proteins were made using New Zealand strain 394/98 rather than 2996: 

AG287NZ-919 

ATGGCTAGCCCCGATGTCAAGTCGGCGGACACGCTGTCAAAACCTGCCGCCC^ 
GCAAAGGAAGATGCGCCACAGGCAGGTTCTCAAGGACAGGGCG^ 
GTTTCGGAAGAAAATACAGGCAATGGCGGTGCGGCAGCAACGGACAAAC 
GATATGCCGCAAAATGCCGCCGATACAGATAGTTTGACAC^ 

ATGGAAAACCAAGCACCGGATGCCGGGGAATCGGAGCAGCCGGCAAACCAACCGGATAT^ 

ATGCAGGGTGACGATCCGTCGGCAGGCGGGGAAAATGCCGGCAATACGG^ 

AATCAAACCGCCGGTTCTCAAAATCCTGCCTCTTCAACCAATCCTAGCGCCACGAATAGCGGT 

ACGAACGTGGGCAATTCTGTTGTGATTGACGGGCCGTCG 

AGTGGCAATAATTTCTTGGATGAAGAAGTACAGCTAAAAT 

AATTACAAGAAAGATGGGAAGAATGACGGGAAGAATGATAAATTO 

GGAATCAATCAATATATTATCTTTTATAAACCTAA^ 

CGGTCGCTTCCGGCCGAGATGCCGCTGATTCCCGTCAATCAGGCGGATACGCTGATTGTC 

CTGACGGGGCATTCCGGCAATATCTTCGCGCCCGAAGGGAATTACCGGTATCTGACTTACG 

GGCGGATCGTATGCCCTCCGTGTTCAAGGCGAACCTTCAAAA 

GAAGTGCTGCATTTTCATACGGAAAACGGCCGTCCGTCCCCX3TCCAGAGGCAGG 

AGCAAATCTGTGGACGGCATTATCGACAGCGGCGATGGTTTGCATATGGGTACG 

GGAAACGGCTTTAAGGGGACTTGGACGGAAAATGGCGGCGGGGATGTTTCCGGAA^ 
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GAAGTGGCGGGAAAATAGAGCTATCGCCCAACAGATGCG 
CAGGATGGATCCGGAGGAGGAGGATGCCAAAGCAAGAGCATCCAAACCTTTC 
GGCCCGGACCGGCCGGTCGGCATCCCCGACCCCGCCGGAACGACGGTCGGCGGCGGCGGGGCCGT 
CCGCACCTGTCCCTGCCCCACTGGGCGGCGCAGGATTTCGCCAAAAGCCTGCA^ 
5 TTGAAAAACCGCCAAGGCTGGCAGGATGTGTGCGCCCAAC^ 
TTTTTTGAACGCTATTTCAOX:CGTGGCAG^ 
CCGGTGCTGAAGGGCGACGACAGGCGGACGGCACAAGCCCGCTTC 
GTCCCCCTGCCTGCCGGTTTGCGGAGCGGAAAAGCCCTTGTCCGCATCAGGC 
GACAATACCGGCGGCACACATACCGCCGACCTCTCCCGA 
10 TTTGAAGGAAGCCGCTTCCTCCCCTACCACACGCGCAAC 

CTCGGTTACGCCGAAGACCCCGTCGAACTTTTTTTTATGCACATC 

AAATACATCCGCATCGGCTATGCCGACAAAAACGAACATCCCTACGTTTCCATCGGACGC 
TACCTCAAGCTCGGGCAGACCTCGATGCAGGGC^^ 

TTGGGTCAAAACCCCAGCTATATCTTTTTCCGCGAGCTTGCCGGAAGCAGCAATGACGGTC 
1 5 ACGCCGTTGATGGGGGAATATGCCGGCGCAGTCGACCGGCACTACATTACCTTGGGCG 

GCCCATCCGGTTACCCGCAAAGCCCTCAACCGCCTGATTATGGCGCAGGATACCGGCAGCGCGATTAAA 

CGCGTGGATTATTTTTGGGGATACGGCGACGAAGCCGGCGA 

CAGCTCCTACCCAACGGTATGAAGCCCGAATACCGCCCGTAAAAGCTT 

20 1 MASPDVKSAD TLSKPAAPW SEKETEAKED APQAGSQGQG APSAQGGQDM 

51 AAVSEENTGN GGAAATDKPK NEDEGAQNDM PQNAADTDSL TPNHTPASNM 

101 PAGNMENQAP DAGESEQPAN QPDMANTADG MQGDDPSAGG ENAGNTAAQG 

151 TNQAENNQTA GSQNPASSTN PSATNSGGDP GRTNVGNSW IDGPSQNITL 

201 THCKGDSCSG NNFLDEEVQL KSEFEKLSDA DKISNYKKDG KNDGKNDKFV 

25 251 GLVADSVQMK GINQYIIFYK PKPTSFARFR RSARSRRSLP AEMPLIPVNQ 

301 ADTLIVDGEA VSLTGHSGNI FAPEGNYRYL TYGAEKLPGG SYALRVQGEP 

351 SKGEMLAGTA VYNGEVLHFH TENGRPSPSR GRFAAKVDFG SKSVDGIIDS 

401 GDGLHMGTQK FKAAIDGNGF KGTWTENGGG DVSGKFYGPA GEEVAGKYSY 

451 RPTDAEKGGF GVFAGKKEQD GSGGGGCQSK SIQTFPQPDT SVINGPDRPV 

30 501 GIPDPAGTTV GGGGAVYTW PHLSLPHWAA QDFAKSLQSF RLGCANLKNR 

551 QGWQDVCAQA FQTPVHSFQA KQFFERYFTP WQVAGNGSLA GTVTGYYEPV 

601 LKGDDRRTAQ ARFPIYGIPD DFISVPLPAG LRSGKALVRI RQTGKNSGTI 

651 DNTGGTHTAD LSRFPITART TAIKGRFEGS RFLPYHTRNQ INGGALDGKA 

701 PILGYAEDPV ELFFMHIQGS GRLKTPSGKY IRIGYADKNE HPYVS IGRYM 

35 751 ADKGYLKLGQ TSMQGIKAYM RQNPQRLAEV LGQNPSYIFF RELAGSSNDG 

801 PVGALGTPIiM GEYAGAVDRH YITLGAPLFV ATAHPVTRKA LNRLIMAQDT 

851 GSAIKGAVRV DYFWGYGDEA GELAGKQKTT GYVWQLLPNG MKPEYRP* 

40 AG287NZ-953 

ATGGCTAGCCCCGATGTCAAGTCGGCGGACACGCTGTC 
GCAAAGGAAGATGCGCCACAGGCAGGTTCTCAAGGAG&GGGCGC 
GTTTCGGAAGAAAATACAGGCAATGGCGGTGCGGCAGCAACGGACAAACCC^ 
GATATGCCGCAAAATGCCGCCGATACAGATAGTTTGACACCGAATC 

45 ATGGAAAACCAAGCACCGGATGCCGGGGAATCGGAGCAGCCGGCAAACCAACCGGATATC 
ATGCAGGGTGACGATCCGTCGGCAGGCGGGGAAAATGCCGGCAATACGGCTGCCCAA 
AATCAAACCGCCGGTTCTCAAAATCCTGCCTCTTCAACCAATCCTAGCGCCACGAATAGC 
ACGAACGTGGGCAATTCTGTTGTGATTGACGGGCCGTCGCAAAATATAACGTTGACCCACTC 
AGTGGCAATAATTTCTTGGATGAAGAAGTAC^GCTAAAATCAGAATTTGAAAAATTAAGTGAT^ 

50 AATTACAAGAAAGATGGGAAGAATGACGGGAAGAATGATAAATTTGTC 
GGAATGAATCAATATATTATCTTTTATAAACCT^ 

CGGTCGCTTCCGGCCGAGATGCCGCTGATTCCCGTCAATCAGGCGGATACGCTGATTC 
CTGACGGGGCATTCCGGCAATATCTTCGCGCCCGAAGGGAATTACCGGTATCTGACTTACGGGGCGG 
GGCGGATCGTATGCCCTCCGTGTTCAAGGCGAACCTTCAAAAGGCGAAATGCTCGCGGGC^ 
55 GAAGTGCTGCATTTTCATACGGAAAACGGCCGTCCGTCCCCGTCC^ 

AGCAAATCTGTGGACGGCATTATCGACAGCGGCGATGGTTTGCATATGGGTACG 
GGAAACGGCTTTAAGGGGACTTGGACGGAAAATGGCGGCGGGGATG 
GAAGTGGCGGGAAAATAGAGCTATCGCCCAACAGATGCGGAAAAGGGCGG^ 
CAGGATGGATCCGGAGGAGGAGGAGCCACCTACAAAGTGGACGAATATCACGC 

60 TTCAACACCAGCACCAACGTCGGCGGTTTTCACGGTCTGACCGG 

AAAATCGACATCACCATCCCCGTTGCCAACCTGCAAAGCGGTTCGCAACACTTTACCGA 

ATCTTCGATGCCGCCCAATATCCGGACATCCGCTTTGTTTC 
GTTGACGGCAACCTGACCATGCACGGCAAAACCGCCCCCGTCAAACTC 
CCGATGGCGAAAACCGAAGTTTGCGGCGGCGACTTCAGCACCACCATCGACC 
65 GTTAACGTTGGTATGACCAAAAGCGTCCGCATCGACATCCAAATCGAGG^ 

1 MASPDVKSAD TLSKPAAPW SEKETEAKED APQAGSQGQG APSAQGGQDM 
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10? ^^f TCN GGAAATDKPK NEDEGAQNDM PQNAftDTDSL TPNHTPASNM 
HI ^^ QAP ^SEQPAN QPDMANTADG MQGDDPSAGG SSSSS 
"J GSQNPASSTN PSATNSGGDF GRTNVGNSW IDgS 

HI !fSn^ ^ 1EE ^ L KSEFE ^SDA DKISNYKKDG KNDGKNdSS 
3oJ f^^H MK GINQYIIFYK PKp TSFARFR RSARSRRSLP ABMPLIPVNQ 
301 ADTLIVDGEA VSLTGHSGNI FAPEGNYRYL TYGAEKLPGG S^RVOGEP 
35! SKGEMLAGTA VYNGEVLHFH TENGRPSPSR GRFAAKVDFG SvSSS 
rdtosto^ KGTWTENGGG DVsSfySa SSSS 

, I ™ EKGGF GVFAGKKEQD GSGGGGATYK VDEYHANARF AIdSSsto 
VGGFYGLTGS VEFDQAKRDG KIDITIPVAN LQSGSQHFTD HLkSdtS 
ESSF* 8 TKFNFNGKKL VSVDGNLTMH StapSSS S2 
AKTEVCGGDF STTIDRTKWG VDYLVNVGMT KSVRIDIQIE AAKQ* 



401 
451 
501 
551 
601 



AG287NZ-961 



GTTTCTCAAGAAAATACAGtt^AT^ 
A TGGAAAACCAAGCACCGGATGCCGGGGAATCGGAGCAGCC 



agtggcaataatttcttgSSS 
ctgacgggg^ttccggSISS^ccgaaS 

AGCAAATCTGTGGACGGCATTATCGACAGCGGCGATGGTTTGCA' 



a^oacgcaactg^^ 



51 
101 
151 
201 
251 
301 



^f™^ TLSKPAAPW SEKETEAKED APQAGSQGQG APSAQGGQDM 
AAVSEENTGN GGAAATDKPK NEDEGAQNDM PQNAADTDSL TPNHtSsNM 
PAGNMENQAP DAGESEQPAN QPDMANTADG MQGDDPSAGG SSS 
TNQAENNQTA GSQNPASSTN PSATNSGGDF GRTNVGNSW IDGPSQNITL 
™^ SCSG NNFLDEEVQL KSEFEKLSDA DKISNYKKDG KNDGKNDKFV 
^^S^ GINQYIIFYK PKPTSFARFR RSARSRRSLP AEMPLIPVNQ 
VSLTGHSGNI FAPEGNYRYL TYGAEKLPGG SYALRVQGEP 
III r^™ GTA ^^H™ TENGRPSPSR GRFAAKVDFG SKSVDGIIDS 
J?} S™ 1 * ^AAIDGNGF KGTWTENGGG DVSGKFYGPA GEEVAGKYSY 
til raJm^^ GVFAGKKEQD GSGGGGATND DDVKKAATVA IAAAYNNGQE 
501 INGFKAGETI YDIDEDGTIT KKDATAADVE ADDFKGLGLK KWTNLTKTV 
f 0! LTTKLADTDA ALADTDAaS 

S? ^f^ ET ^tntvkidek leavadtvdk haeafndiad sldetotkad 

ill QTAEETKQNV DAKVKAAETA AGKAEAAAGT ANTAADKAEA 

III y^fyT? IKA DIATNKENIA kkansadvyt reesdskpvr IDGLNATTEK 

finJ KT^^f^ SIADHDTRLN GLDKTVSDLR KETRQGLAEQ AALSGLFQPY 
Ml VGGYKSESAV AIGTGFRFTE NFAAKAGVAV GTSSGSSAAY 
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Example 3 - hybrids ofAG983 
Protein 983 has the following sequence: 

983 + AG983 

1 MRTTPTFPTK TFKPTAM&LA VATTLSAC LG GGGGGTSAPD FNAGGTGIGS 

51 NSRATTAKSA AVSYAGIKNE MCKDRSMLCA GRDDVAVTDR DAKINAPPPN 

101 LHTGDFPNPN DAYKNLINLK PAIEAGYTGR GVEVGIVDTG ESVGSISFPE 

151 LYGRKEHGYN ENYKNYTAYM RKEAPEDGGG KDIEASFDDE AVIETEAKPT 

201 DIRHVKEIGH IDLVSHIIGG RSVDGRPAGG IAPDATLHIM NTNDETKNEM 

251 MVAAIRNAWV KLGERGVRIV NNSFGTTSRA GTADLFQIAN SEEQYRQALL 

301 DYSGGDKTDE GIRLMQQSDY GNLSYHIRNK NMLFIFSTGN DAQAQPNTYA 

351 LLPFYEKDAQ KGIITVAGVD RSGEKFKREM YGEPGTEPLE YGSNHCGITA 

401 MWCLSAPYEA SVRFTRTNPI QIAGTSFSAP IVTGTAALLL QKYPWMSNDN 

451 LRTTLLTTAQ DIGAVGVDSK FGWGLLDAGK AMNGPASFPF GDFTADTKGT 

501 SDIAYSFRND ISGTGGLIKK GGSQLQLHGN NTYTGKTIIE GGSLVLYGNN 

551 KSDMRVETKG ALIYNGAASG GSLNSDGIVY LADTDQSGAN ETVHIKGSLQ 

601 LDGKGTLYTR LGKLLKVDGT AIIGGKLYMS ARGKGAGYLN STGRRVPFLS 

651 AAKIGQDYSF FTNIETDGGL LASLDSVEKT • AGSEGDTLSY YVRRGNAART 

701 ASAAAHSAPA GLKHAVEQGG SNLENLMVEL DASESSATPE TVETAAADRT 

751 DMPGIRPYGA TFRAAAAVQH ANAADGVRIF NSLAATVYAD STAAHADMQG 

801 RRLKAVSDGL DHNGTGLRVI AQTQQDGGTW EQGGVEGKMR GSTQTVGIAA 

851 KTGENTTAAA TLGMGRSTWS ENSANAKTDS ISLFAGIRHD AGDIGYLKGL 

901 FSYGRYKNSI SRSTGADEHA EGSVNGTLMQ LGALGGVNVP FAATGDLTVE 

951 GGLRYDLLKQ DAFAEKGSAL GWSGNSLTEG TLVGLAGLKL SQPLSDKAVL 

1001 FATAGVERDL NGRDYTVTGG FTGATAATGK TGARNMPHTR LVAGLGADVE 

1051 FGNGWNGLAR YSYAGSKQYG NHSGRVGVGY RF* 

AG983 thus has the following basic sequence: 

TSAPD FNAGGTGIGS 

NSRATTAKSA AVSYAGIKNE MCKDRSMLCA GRDDVAVTDR DAKINAPPPN 
LHTGDFPNPN DAYKNLINLK PAIEAGYTGR GVEVGIVDTG ESVGSISFPE 
LYGRKEHGYN ENYKNYTAYM RKEAPEDGGG KDIEASFDDE AVIETEAKPT 
DIRHVKEIGH IDLVSHIIGG RSVDGRPAGG IAPDATLHIM NTNDETKNEM 
MVAAIRNAWV KLGERGVRIV NNSFGTTSRA GTADLFQIAN SEEQYRQALL 
DYSGGDKTDE GIRLMQQSDY GNLSYHIRNK NMLFIFSTGN DAQAQPNTYA 
LLPFYEKDAQ KGIITVAGVD RSGEKFKREM YGEPGTEPLE YGSNHCGITA 
MWCLSAPYEA SVRFTRTNPI QIAGTSFSAP IVTGTAALLL QKYPWMSNDN 
LRTTLLTTAQ DIGAVGVDSK FGWGLLDAGK AMNGPASFPF GDFTADTKGT 
SDIAYSFRND ISGTGGLIKK GGSQLQLHGN NTYTGKTIIE GGSLVLYGNN 
KSDMRVETKG ALIYNGAASG GSLNSDGIVY LADTDQSGAN ETVHIKGSLQ 
LDGKGTLYTR LGKLLKVDGT AIIGGKLYMS ARGKGAGYLN STGRRVPFLS 
AAKIGQDYSF FTNIETDGGL LASLDSVEKT AGSEGDTLSY YVRRGNAART 
ASAAAHSAPA GLKHAVEQGG SNLENLMVEL DASESSATPE TVETAAADRT 
DMPGIRPYGA TFRAAAAVQH ANAADGVRIF NSLAATVYAD STAAHADMQG 
RRLKAVSDGL DHNGTGLRVI AQTQQDGGTW EQGGVEGKMR GSTQTVGIAA 
KTGENTTAAA TLGMGRSTWS ENSANAKTDS ISLFAGIRHD AGDIGYLKGL 
FSYGRYKNSI SRSTGADEHA EGSVNGTLMQ LGALGGVNVP FAATGDLTVE 
GGLRYDLLKQ DAFAEKGSAL GWSGNSLTEG TLVGLAGLKL SQPLSDKAVL 
FATAGVERDL NGRDYTVTGG FTGATAATGK TGARNMPHTR LVAGLGADVE 
FGNGWNGLAR YSYAGSKQYG NHSGRVGVGY RF* 

AG983 was expressed as a hybrid, with ORF46.1, 741, 961 or 961c at its C-terminus: 

AG983-ORF46.1 

ATGACTTCTGCGCCCGACTTCAATGCAGGCGGTACCGGTATCGGCAGC 
GCAGTATCTTACGCCGGTATCAAGAACGAAATGTGCAAAGACAGAAGCATGCTC 
GTTA(^GACAGGGATGCCAAAATCAATGCCCCCCCCCCGAATCTGCATACCGGAGA 
TACAAGAATTTGATC^CCTCAAACCTGCAATT^ 

ACAGGCGAATCCGTCGGCAGCATATCCTTTCCCGAACTGTATGGCAGAAAAGAA 

AACTATACGGCGTATATGCGGAAGGAAGCGCCTGAAGACGGAGGCGGTAAAGACATTGAAG^ 

GCCGTTATAGAGACTGAAGGAAAGCCGACGGATATCCGCCACGTAAAA 
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ATTATTGGCXX3GCGTTCCGTGGACGGCAGA 

AATGATGAAACCAAGAACGAAATGATGGTTGCAGCCATCCGC^ 

ATCGTCAATAACAGTTTTGGAACAACATCGAGGGCAGGCACTGCCGAC 

TACCGCCAAGCGTTGCTCGACTATTCCGGCGGTGATAAAACAGACGAG 
5 GGCAACCTGTCCTACCACATCCGTAATAAAAACATGCTTTTCATCTTTTCG 

AACACATATGCCCTATTGCCATTTTATGAAAAAGACGCT^ 

GGAGAAAAGTTCAAACGGGAAATGTATGGAGAACCGGGTACAGAACCGCTTGAG 

ACIWCATGTGGTGCCTGTCGGCACCCTATGAAGCAAGCGTC 

ACATCCTTTTCCGCACCCATCGTAACCGGCAC 
1 0 CTGCGTACCACGTTGCTGACGACX3GCTCAGGACATCGGTGGAGTC 

GATGCGGGTAAGGCCATGAACGGACCCGCGTCCTTTCCGTTCGGCGACTTO 

ATTGCCTACTCCTTCCGTAACGACATTTC^GGCACGGGCGGCCTGAT 

GGCAACAACACCTATACGGGCAAAACCATTATCGAAGGCGGTTCX^ 

CGCGTCGAAACCAAAGGTGCGCTGATTTATAACGGGGCGGCATCC 
15 CTGGCAGATACCGACCAATC CGGCGCAAACGAAACCGTACACATCAAAGGCAGTCTGCAGC 

CTGTACA.CA.CGTTTGGGCAAACTGCTGAAAGTGGACGGT^ 

GGCAAGGGGGCAGGCTATCTCAACAGTACCGGACGACGTGTTCCCTTCCTC 

TCTTTCTTCACAAACATCGAAACCGACGGCGGCCTGCTGGCTTCCCTC 

GGCGACACGCTGTCCTATTATGTCCGTCGCGGCAATGC^ 
20 GGTCTGAAACACGCCGTAGAACAGGGCGGCAGCAATCT^ 

GCAACACCCGAGACGGTTGAAACTGCGGCAGCCGACCGCACAGATATGCCGGGC^ 

CGCGCAGCGGCAGCCGTACAGCATGOSAATGCCGCCG 

GCCGACAGTACCGCCGCCCATGCCGATATGCAGGGACGCCGCCTGAAAGCCGTATC 
ACGGGTCTGCGCGTCATCGCGCAAACCCAACAGGACGGTGGAA 
25 GGCAGTACCCAAACCGTCGGCATTGCCGCGAAAACCGGCGAAAATACGACAG^ 
AGCACATGGAGCGAAAACAGTGCAAATGCAAAAACCGACAGCAOT 
GATATCGGCTATCTCAAAGGCCTGTTCTCCTACGGACGCT 
CATGCGGAAGGCAGCGTCAACGGCACGCTGATGCAGCT 

GGAGATTTGACGGTCGAAGGCGGTCTGCGCTACGACCTGCTCAAACAGGATGC^ 
30 GGCTCGAGCGGCAACAGCCTCACTGAAGGCACGCTGGTCGGACTCGCGGGTC 

AAAGCCGTCCTGTTTGCAACGGCGGGCGTGGAACGCGACCTGAACGGACGCGACTACACGGT 
GGCGCGACTGCAGCAACCGGCAAGACGGGGGCACGCAATATGCCGCACACCCGTCTGGTTGCCG 
GTCGAATTCGGCAACGGCTGGAACGGCTTGGCACGTTACAGCTACGCCGGTTCCA 
CGAGTCGGCGTAGGCTACCGGTTCCTCGACGGTGGCGGAGGCACT^ 
35 CGGCAGGTTCTCGACCGTCAGCATTTCGAACCCGACGGGAAATAC^ 
CGCAGCGGCCATATCGGATTGGGAAAAATACAAAGCCAT 

GGAAATATCGGCTACATTGTCCGCTTTTCCGATCACGGGCACGAAGTCCATTCCCCCT^ 
TCCGATTCTGATGAAGCCGGTAGTCCCGTTGACGGATTO 

CCCGCCGACGGCTATGACGGGCCACAGGGCGGCGGCTATCCCGCTCCCAAAGGCGCGAGGGATATATACA 
40 ATAAAAGGCGTTGCCCAAAATATCCGCCTGAACC 

CACAATGCCGGTAGTATGCK5ACGCAAGGAGTAGGCGACGGATTCAAACGCGCCACC 

AGATCGGGCAATGCCGCCGAAGCCTTCAAOSGCACT^ 

GTCGGCGCAGGCGATGCCGTGCAGGGCATAAGCGAAGGCTCAAACATTC 

ACCGAAAACAAGATGGCGCGCATCAACGATTTGGCAGATATGGCGCAACTCAAAGA 
45 GATTGGGCAGTCCAAAACCCCAATGCCGCACAAGGCATAGAAGCCGTCAGCAATATCTTTAT 

AAAGGGATTGGAGCTGTTCGGGGAAAATACGGC™ 

GCGATCGCATTGCCGAAAGGGAAATCCGCCGTCAGCGACAATTTTGCCGATGC 
TACCATTCCCGAAATATCCX3TTGAAACTTGGAGCAGCGCT 

TCAAACGGCAAAAATGTCAAACTGGCAGACGAACGCCACCCGAAGACAGGCGTACCGOT 
50 AATTTTGAGAAGCACGTGAAATATGATACGCTCGAGCACCACCACCACCACCACTGA 

1 MTSAPDFNAG GTGIGSNSRA TTAKSAAVSY AGIKNEMCKD RSMLCAGRDD 

51 VAVTDRDAKI NAPPPNLHTG DFPNPNDAYK NLINLKPAIE AGYTGRGVBV 

101 GIVDTGESVG SISFPELYGR KEHGYNENYK NYTAYMRKEA PEDGGGKDIE 

55 151 ASFDDEAVIE TEAKPTDIRH VKEIGHIDLV SHIIGGRSVD GRPAGGIAPD 

201 ATLHIMNTND ETKNEMMVAA IRNAWVKLGE RGVRIVNNSF GTTSRAGTAD 

251 LFQIANSEEQ YRQALLDYSG GDKTDBGIRIi MQQSDYGNLS YHIRNKNMLF 

301 IFSTGNDAQA QPNTYALLPF YEKDAQKGII TVAGVDRSGE KFKREMYGEP 

351 GTEPLEYGSN HCGITAMWCL SAPYEASVRF TRTNPIQIAG TSFSAPIVTG 

60 401 TAALLLQKYP WMSNDNLRTT LLTTAQDIGA VGVDSKFGWG LLDAGKAMNG 

451 PASFPFGDFT ADTKGTSDIA YSFRNDISGT GGLIKKGGSQ LQLHGNNTYT 

501 GKTIIEGGSL VLYGNNKSDM RVETKGALIY NGAASGGSLN SDGIVYLADT 

551 DQSGANETVH IKGSLQLDGK GTLYTRLGKL LKVDGTAIIG GKLYMSARGK 

601 GAGYLNSTGR RVPFLSAAKI GQDYSFFTNI ETDGGLLASL DSVEKTAGSE 

65 651 GDTLSYYVRR GNAARTASAA AHSAPAGLKH AVEQGGSNLE NLMVELDASE 

701 SSATPETVET AAADRTDMPG IRPYGATFRA AAAVQHANAA DGVRIFNSLA 

751 ATVYADSTAA HADMQGRRLK AVSDGLDHNG TGLRVIAQTQ QDGGTWEQGG 
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801 VBGKMRGSTQ TVGIAAKTGE NTTAAATLGM GRSTWSENSA NAKTDSISLF 

851 AGIRHDAGDI GYLKGLFSYG RYKNSISRST GADEHAEGSV NGTLMQLGAL 

901 GGVNVPFAAT GDLTVEGGLR YDLLKQDAFA EKGSALGWSG NSLTEGTLVG 

951 LAGLKLSQPL SDKAVLFATA GVERDLNGRD YTVTGGFTGA TAATGKTGAR 

1001 NMPHTRLVAG LGADVEFGNG WNGLARYSYA GSKQYGNHSG RVGVGYRFLD 

1051 GGGGTGSSDL ANDSFIRQVL DRQHFEPDGK YHLFGSRGEL AERSGHIGLG 

1101 KIQSHQLGNL MIQQAAIKGN IGYIVRFSDH GHEVHSPFDN HASHSDSDEA 

1151 GSPVDGFSLY RIHWDGYEHH PADGYDGPQG GGYPAPKGAR DIYSYDIKGV 

1201 AQNIRLNLTD NRSTGQRLAD RFHNAGSMLT QGVGDGFKRA TRYSPELDRS 

1251 GNAAEAFNGT ADIVKNIIGA AGEIVGAGDA VQGISEGSNI AVMHGLGLLS 

1301 TENKMARIND LADMAQLKDY AAAAIRDWAV QNPNAAQGIE AVSNIFMAAI 

1351 PIKGIGAVRG KYGLGGITAH PIKRSQMGAI ALPKGKSAVS DNFADAAYAK 

1401 YPSPYHSRNI RSNLEQRYGK ENITSSTVPP SNGKNVKLAD QRHPKTGVPF 

1451 DGKGFPNFEK HVKYOTLEHH HHHH* 



AQ983-741 

ATGACT^CTGCGCCCGACTTCAATGCAGGCGG 

GCAGTATCTTAGGCCGGTATCAAGAACGAAATGTGCAAAGACAGAAGCATGCTCTC 

GTTACAGAGAGGGATGCC^AAATCAATGCCCCCCCCCCGAATCTGCATACCGGAGACTO 

TACAAGAATTTGATCAACCTCAAACCTGCAATTGAAGCAGGCT^^ 

ACAGGCGAATCCGTCGGCAGCATATCCTTTCCCGAACTGTATGGCAGAAAAGAACACG^ 

AACTATACGGCGTATATGCGGAAGGAAGCGCCTGAAGACGGAGGCGGTAAAGACATTGAAGCOT 

GCCGTTATAGAGACTGAAGCAAAGCCGACGGATATCCGCCACGTAAAAGAAATCGGACACATCGATTTGGTCTC 

ATTATTGGCGGGCGTTCCGTGGACGGCAGACCTGCAGGCGGTATTGCGCCCGATGCGACGCT 

AATGATGAAACCAAGAACGAAATGATGGTTGCAGCCATCCGCAATGCATGOT 

ATCGTCAATAACAGTTTTGGAAGAACATCGAGGG 

TACCGCCAAGCGTTGCTCGACTATTCCGGCGGTGATAAAACAGA 

GGCAACCTGTCCTACCAGATCCGTAATAAAAACATGCTTTTCATCTTTTC 

AACACATATGCCCTATTGCCATTTTATGAAAAAGACGCTCAAAAAGGCATT 

GGAGAAAAGTTCAAACGGGAAATGTATGGAGAACCGGGTACAGAACCGCTTGAGTATGG^ 

ACTGCCATGTGGTGCCTGTCGGCACCCTATGAAGCAAGCGTCCGTTTCACCCGTACAAACCCGA 

ACATCCTTTTCCGCACCCATCGTAACCGGGACGGCGGCTCTGCTGCTC 

CTGCGTACCACGTTGCTGACGACGGCTCAGGACATCGGTGCAGTCGGCGTGGACAGCAAG 

GATGCGGGTAAGGCCATGAACGGACCCGCGTCCTTTCCGTTCG 

ATTGCCTACTCCTTCCGTAACGAGATTTCAGGGACGGGCGGCCTGATCAAAAAAGGC 

GGCAACAACACCTATACGGGCAAAACCATTATCGAAGGCGGTTCGCTGGTGTTC 

CGCGTCGAAACCAAAGGTGCGCTGATTTATAACGGGGCGGCATC 

CTGGCAGATACCGACCAATCCGGCGCAAACGAAACCGTACACATCAAAGGCAGTCTGCAGCT^ 

CTGTAC^CACGTTTGGGCAAACTGCTGAAAGTGGACGGTACGGCGATO 

GGCAAGGGGGCAGGCTATCTCAACAGTACCGGACGACGTGTTCCCTTCCTGAGTGCC^ 

TCTTTCTTCACAAACATCGAAACCGACGGCGGCCTGCTGGCTTCCCTC 

GGCGACACGCTGTCCTATTATGTCCGTCGCGGCAATGCGGCACGG^ 

GGTCTGAAACACGCCGTAGAACAGGGCGGCAGCAATCTGGAAAACCTGATGGTCGAAC 

GCAAGACCCGAGACGGTTGAAACTGCGGCAGCCGACCGCAGAGATATGCCGGGCATCCGCCCC 

CGCGCAGCGGCAGCCGTACAGCATGCGAATGCCGCCGACGGTGTACGCATCTTCAACAGTCTC 

GCCGAGAGTACCGCCGCCCATGCCGATATGCAGGGACGCC^ 

ACGGGTCTGCGCGTCATCGCGCAAACCCAACAGGACGGTGGAACGTGGGAACAC^ 

GGCAGTACCCAAACCGTCXSGCATTGCCGCGAAAACCGGCGAAAATACGACAGCA^ 

AGCACATGGAGCGAAAACAGTGCAAATGCAAAAACCGACAGCATTAGTC 

GATATCGGCTATCTCAAAGGCCTGTTCTCCTACGGACGCTACAAAAACAGCATCAGC 

CATGCGGAAGGCAGCGTCAACGGCACGCTGATGCAGCTGGGCGCACTGGGCGGTGTC 

GGAGATTTGACGGTCGAAGGCGGTCTGC^ 

GGCTGGAGCGGCAACAGCCTCACTGAAGGCACGCTGGTC^ 

AAAGCCGTCCTGTTTGCAkCGGCGGGCGTGGAACGCGACCTGAAC 

GGCGCGACTGCAGCAACCGGCAAGACGGGGGCACGCAATATGCCGCACACCCGTCT^ 

GTCGAATTCGGCAACGGCTCGAACGGCTTGGCACGTTACAGCT^^ 

CGAGTCGGCGTAGGCTACCGGTTCCTCGAGGGATCCGGAGGGGGTGGTC 

GATGCACTAACCGCACCGCTCGACGATAAAGACAAAGGTTTGCAGTCTT 

GAGAAACTGAAGCTGGCGGCACAAGGTGCGGAAAAAACTTATGGAAACGGTG^ 

AACGACAAGGTCAGCCGTTTCGACTTTATCCGCCAAA^ 

TTCCAAGTATACAAACAAAGCCATTCCGCCT^ 

AAGATGGTTGCGAAACGCCAGTTCAGAATCGGCGACATAGCGGGCGAACATACATC 

GGCAGGGCGACATATCGCGGGACGGCGTTCGGTTCAGACGATGC CGGCGGAAAACTGACCTACAC C ATAGATTTCGCC 

GCCAAG<^GGGAAACGGCAAAATCGAACATTTGAAATCGCCAGAACTCAATGTCGA 

CCGGATGGAAAACGCCATGCCGTCATCAGCGGTTCCGTCCOT 
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ATCTTTGGCGGAAAAGCCCAGGAA 
GCCGCCAAGCAACTCGAGCACCACCACCACCACCACTGA 

1 MTSAPDFNAG GTGIGSNSRA TTAKSAAVSY AGIKNEMCKD RSMLCAGRDD 

5 51 VAVTDRDAKI NAPPPNLHTG DFPNPNDAYK NLINLKPAIE AGYTGRGVEV 

101 GIVDTGESVG SISFPELYGR KEHGYNENYK NYTAYMRKEA PEDGGGKDIE 

151 ASFDDEAVIE TEAKPTDIRH VKEIGHIDLV SHIIGGRSVD GRPAGGIAPD 

201 ATLHIMNTND ETKNEMMVAA IRNAWVKLGE RGVRIVNNSF GTTSRAGTAD 

in 251 LFQIANSEEQ YRQALLDYSG GDKTDEGIRL MQQSDYGNLS YHIRNKNMLF 

1U 301 IFSTGNDAQA QPNTYALLPF YEKDAQKGII TVAGVDRSGE KFKREMYGEP 

351 GTEPLEYGSN HCGITAMWCL SAPYEASVRF TRTNPIQIAG TSFSAPIVTG 

401 TAALLLQKYP WMSNDNLRTT LLTTAQDIGA VGVDSKFGWG LLDAGKAMNG 

451 PASFPFGDFT ADTKGTSDIA YSFRNDISGT GGLIKKGGSQ LQLHGNNTYT 

- 501 GKTIIEGGSL VLYGNNKSDM RVETKGALIY NGAASGGSLN SDGIVYLADT 
15 551 DQSGANBTVH IKGShQhDGK GTLYTRLGKL LKVDGTAIIG GKLYMSARGK 

601 GAGYLNSTGR RVPFLSAAKI GQDYSFFTNI ETDGGLLASL DSVEKTAGSE 

651 GDTLSYYVRR GNAARTASAA AHSAPAGLKH AVEQGGSNLE NLMVELDASE 

701 SSATPETVET AAADRTDMPG IRPYGATFRA AAAVQHANAA DGVRIFNSLA ; 

on 751 ATVYADSTAA HADMQGRRLK AVSDGLDHNG TGLRVTAQTQ QDGGTWEQGG / 

*V 801 VEGKMRGSTQ TVGIAAKTGE NTTAAATLGM GRSTWSENSA NAKTDSISLF 

851 AGIRHDAGDI GYLKGLFSYG RYKNSISRST GADEHAEGSV NGTLMQLGAL ' 

901 GGVNVPFAAT GDLTVEGGLR YDLLKQDAFA EKGSALGWSG NSLTEGTLVG 

951 LAGLKLSQPL SDKAVLFATA GVERDLNGRD YTVTGGFTGA taatgktgar 

- 1001 NMPHTRLVAG LGADVEFGNG WNGLARYSYA GSKQYGNHSG RVGVGYRFLE - 
^ 1051 GSGGGGVAAD IGAGLADALT APLDHKDKGL QSLTLDQSVR KNEKLKLAAQ 

1101 GAEKTYGNGD SLNTGKLKND KVSRFDFIRQ IEVDGQLITL ESGEFQVYKQ 

1151 SHSALTAFQT EQIQDSEHSG KMVAKRQFRI GDIAGEHTSF DKLPEGGRAT 

1201 YRGTAFGSDD AGGKLTYTID FAAKQGNGKI EHLKSPELNV DLAAADIKPD 

1251 GKRHAVISGS VLYNQAEKGS YSLGIFGGKA QEVAGSAEVK TVNGIRHIGL 

30 1301 AAKQLEHHHH HH* 

AG983-961 

ATGACTOCTGCGCCCGACTTCAATGCAGGCGGTACCGGTATCGGC 
35 <^GTATCTTACGCCGGTATCAAGAACGAAATGTGCAAAGACAGAAGCATGCTCTGTGCCGGTC 

GTTACAGACAGGGATGCC AAAATCAATGCCCCCCC CCCGAATCTGCATAC CGGAGACTTTCC AAAC C C AAATGACGCA 
TACAAGAATTTGATCAACCTCAAACCTGCAATTGAAGCAGGCTATACAG^ 

ACAGGCGAATCCGTCGGCAGCATATCCTTTCCCGAACTGTATGGCAGAAAAGAACACGGCTATAAC 

AACTATACGGCGTATATGCGGAAGGAAGCGCCTGAAGACGGAGGCGCT 
40 GCCGTTATAGAGACTGAAGCAAAGCCGACGGATATCCGCCACGTAAA^ 

ATTATTGGCGGGCGTTCCGTGGACGGCAGACCTGCAGGCGGTATTGCGCCCGAT^ 

AATGATGAAACCAAGAACGAAATGATGGTTGCAGCCATCCGCAATGCATG^ 

ATCGTCAATAACAGTTTTGGAACAAGATCGAGGGCAGGCACTGCCG^ 

TACCGCCAAGCGTTGCTCGACTATTCCGGCGGTGATAAA^ 
45 GGCAACCTGTCCTACCACATCCGTAATAAAAACATGCTTTTCATCTTTTCGACAGGC 

AACAGATATGCCCTATTGCCATTTTATGAAAAAGACGOT 

GGAGAAAAGTTCAAACGGGAAATGTATGGAGAACCGGGTACAGAACCGCTTCAGTATGGCTCCAACC^ 
ACTGCCATGTGGTGCCTGTCGGCACCCTATGAAGCAAGCGTCCGTTTCAC 
ACATCCTTTTCCGCACCCATCGTAACCGGCACGGCGGCTCTGCTGCTGCAGAAATACCCGTG^ 
50 CTGCGTACCACOTTGCTGACGACGGC^ 

GATGCGGGTAAGGCCATGAACGGACCCGCGTCCTTTCCGTTCGGCGACTTTACCGCCGATACGAA^ 
ATTGCCTACTCCTTCCGTAACGACATTTCAGGCACGGGCGGCCTC 

GGCAACAACAC CTATAC GGGC AAAAC C ATTATCG AAGGC GGT TCGC TGGTGTTGTAC GGC AAC AACAAATC GGAT ATG 
CGCGTCGAAACCAAAGGTGCGCTGATTTATAACGGGGCGGCATCCGGCGGCAGCCTGAACA 
55 CTGGCAGATACCGAC CAATCCGGC GCAAACGAAACCGT ACACATCAAAGGCAGTCTGCAGC TGGACGGCAAAGGTACG 

CTGTACACACGTTTGGGCAAACTGCTGAAAGTGGACGGTACGGCGATTATCGGCGGC 
GGCAAGGGGGCAGGCTATCTCAAGAGTACCGGACGACGTC^ 
TCTTTCTTCACAAACATCGAAACCGACGGCGGCCTGCTGGCTTC 

GGCGACACGCTGTCCTATTATGTCCGTCGCGGCAATGCGGCACGGACTGCTTCGGCAGCGGCACATO 
O0 GGTCTGAAACACGCCGTAGAACAGGGCGGCAGCA ATPTrtH A A A ArrrtriArrr^n^n^7,nrT^riT,mrLnnwn^rtTiT,m^m^^ 
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CATGCGGAAGGCAGCGTCAACGGCACGCTGATCCAGC 
GGAGATTTGACGGTCGAAGGCGGTCTGCGCTACGACCTGCTCAAACAGG 
GGCTGGAGCGGCAACAGCCTCACTGAAGGCACGCTGGTCGGACTCGCGGGTCTGAAG^ 
AAAGCCGTCCTGTTTGCAACGGCGGGCGTGGAACGCGACCTGAACGGACGCGACTACACGG 

GTCGAATTCGGCAACGGCTGGAACGGCTTGGCACGTTACAGCTACGCCGGTT 

CGAGTCGGCGTAGGCTACCGGTTCCTCGAGGGTGGCGGAGGCAC 

GCTGCGACTGTGGCGATTGCTGCTGCCTACAACAA 

GACATTGATGAAGACGGCACAATTACCAAAAAAGACGCAACTGC^ 
10 GGTCTGAAAAAAGTCGTGACTAACCTGACCAAAACCGTCAATGAAAACAA^^ 

GCAGAATCTGAAATAGAAAAGTTAACAACCAAGTTAGCAGAC^ 

GATGCAACGACCAACGCCTTGAATAAATTGGGAGAAAATATAAC 

AAAATTGATGAAAAATTAGAAGCCGTGGCTGATACCGTCGAG^^ 

TTGGATGAAACCAACACTAAGGCAGACGAAGCCGTCAAAAC^ 
1 5 CAAAACGTCGATGCCAAAGTAAAAGCTGCAGAAACTGCAGCAGGCAAAGCCGAAGCTGCOT 

GCAGCCGACAAGGCCGAAGCTGTCGCTGCAAAAGTTACCGACATCAAAGCTGATATCGC 

GCTAAAAAAGCAAACAGTGCCGACGTGTACACCAGAGAAGAGTCT^ 

GCTACTACCGAAAAATTGGACACACGCTTGGCTTCTGCTGAAAAATCCAT 

TTGGATAAAACAGTGTCAGACCTGCGCAAAGAAACCCGC 
20 CAACCTTACAACGTGGGTCGGTTCAATGTAACGGCTGCAGTCGGCGGCTACAAATC 

ACCGGCTTCCGCTTTACCGAAAACTTTGCCGCCAAAGCAGGCGTGGCAGTCG 

TACCATGTCGGCGTCAATTACGAGTGGCTCGAGCACCACCACCAC CAC CACTGA 

1 MTSAPDFNAG GTGIGSNSRA TTAKSAAVSY AGIKNEMCKD RSMLCAGRDD 

25 51 VAVTDRDAKI NAPPPNLHTG DFPNPNDAYK NLINLKPAIE AGYTGRGVEV 

101 GIVDTGESVG SISFPELYGR KEHGYNENYK NYTAYMRKEA PEDGGGKDIE 

151 ASFDDEAVIE TEAKPTDIRH VKEIGHIDLV SHIIGGRSVD GRPAGGIAPD 

201 ATLHIMNTND ETKNEMMVAA IRNAWVKLGE RGVRIVNNSF GTTSRAGTAD 

251 LFQIANSEEQ YRQALLDYSG GDKTDEGIRL MQQSDYGNLS YHIRNKNMLF 

30 301 IFSTGNDAQA QPNTYALLPF YEKDAQKGII TVAGVDRSGE KFKREMYGEP 

351 GTEPLEYGSN HCGITAMWCL SAPYEASVRF TRTNPIQIAG TSFSAPIVTG 

401 TAALLLQKYP WMSNDNLRTT LLTTAQDIGA VGVDSKFGWG LLDAGKAMNG 

451 PASFPFGDFT ADTKGTSDIA YSFRNDISGT GGLIKKGGSQ LQLHGNNTYT 

501 GKTIIEGGSL VLYGNNKSDM RVETKGAL I Y NGAASGGSLN SDGIVYLADT 

35 551 DQSGANETVH IKGSLQLDGK GTLYTRLGKL LKVDGTAIIG GKLYMSARGK 

601 GAGYLNSTGR RVPFLSAAKI GQDYSFFTNI ETDGGLLASL DSVEKTAGSE 

651 GDTLSYYVRR GNAARTASAA AHSAPAGLKH AVEQGGSNLE NLMVELDASE 

701 SSATPETVET AAADRTDMPG IRPYGATFRA AAAVQHANAA DGVRIFNSLA 

751 ATVYADSTAA HADMQGRRLK AVSDGLDHNG TGLRVIAQTQ QDGGTWEQGG 

40 801 VEGKMRGSTQ TVGIAAKTGE NTTAAATLGM GRSTWSENSA NAKTDSISLF 

851 AGIRHDAGDI GYLKGLFSYG RYKNSISRST GADEHAEGSV NGTLMQLGAL 

901 GGVNVPFAAT GDLTVEGGLR YDLLKQDAFA EKGSALGWSG NSLTEGTLVG 

951 IiAGLKLSQPL SDKAVLFATA GVERDLNGRD YTVTGGFTGA TAATGKTGAR 

1001 NMPHTRLVAG LGADVEFGNG WNGLARYSYA GSKQYGNHSG RVGVGYRFLE 

45 1051 GGGGTGSATN DDDVKKAATV AIAAAYNNGQ EINGFKAGET IYDIDEDGTI 

1101 TKKDATAADV EADDFKGLGL KKWTNLTKT VNENKQNVDA KVKAAESEIE 

1151 KLTTKLADTD AALADTDAAL DATTNALNKL GENITTFAEE TKTNIVKIDE 

1201 KLEAVADTVD KHAEAFNDIA DSLDETNTKA DEAVKTANEA KQTAEETKQN 

1251 VDAKVKAAET AAGKAEAAAG TANTAADKAE AVAAKVTDIK ADIATNKDNI 

50 1301 AKKANSADVY TREESDSKFV RIDGLNATTE KLDTRLASAE KSIADHDTRL 

1351 NGLDKTVSDL RKETRQGLAE QAALSGLFQP YNVGRFNVTA AVGGYKSESA 

1401 VAIGTGFRFT ENFAAKAGVA VGTSSGSSAA YHVGVNYEWL EHHHHHH* 

55 AG983-961C 

ATGACTTCTGCGCCCGACTOGAATGCAGGCGGTACCG 

GCAGTATCTTACGCCGX3TATCAAGAACGAAATGTC 

GTTACAGACAGGGATGCCAAAATCAATGCCCCCCCCCCG^ 

TACAAGAATTTGATCAACCTCAAACCTGCAATTGAAGCAGGCTATACAGGACG^ 
60 ACAGGCGAATCCGTCGGCAGCATATCCTTTCCCGAACTGTATGGCAGAAAAGAACACGGCTATAAC 

AACTATACGGCGTATATGCGGAAGGAAGCGCCT^ 

GCCGTTATAGAGACTGAAGCAAAGCCGACGGATATC^ 

ATTATTGGCGGGCGTTCCGTGGACGGCAGACCTGCAGGC^ 

AATGATGAAACCAAGAACGAAATGATGGTTGCAGCCATC^ 
65 ATCGTCAATAACAGTTTTGGAACAACATCGAGGGCAGGCACTGCCGACCTTTO 

TACCGCCAAGCGTTGCTCGACTATTCCGGCGGTGATAAAACAGACGAGGGTATCCG^ 

GGCAACCTGTCCTACQACATCCGTAATAAAAACATGCTTTTCATCTTTTCGACAGGCAAT 
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ACTGCCATGTGGTGCCTGTCGGCACCCTATG^GCAA 

acatccttttcc*^^ 

^^^^^^^^^^^^^ 

CTGGCAGATACCGACCAATCCGGCGC^^ 



ACGGGTCTGCGCGTCATCGCGGAAACCCAACAGGAC^ 

agcacatggagcgaaaacagt^tg^Sa^cc 

GATATCGGCTATCTCAAAGGCcW 

^^^^^^^^^^^^^^^ 

o^gactccagcaS 
cgagtcggcgtaggctaccggttcctcga^ 




1 

51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 



MTSAPDFNAG GTGIGSNSRA TTAKSAAVSY AGIKNEMCKD RSMLCAGRDn 
NAPPPNLHTG »™DAYK NLlSpS SESSE! 

GIVDTGESVG SISFPELYGR KEHGYNENYK NYTAYMRKEA PEDGGgSJS 
TEAKPTDIRH VKEIGHIDLV SHIIGGRsS gSSSpD 

ATLHIMNTND ETKNEMMVAA IRNAWVKLGE RGVRIVNNSF GTTSRAGTAD 

LFQIANSBEQ YRQALLDYSG GDKTDEGIRL MQQSDYGNLS ££SS£? 

SSSSl SEES r^ 011 tSwSSS SkJeSp 

SS^f^S S5i ITAMWCL SAPYEASVRP TRTNPIQIAG TSFSAPXVTC 
p^£^™ WMSNDNLRTT LLTTAQDIGA VGVDSKFGWG LLDAGKAMNG 
SSSS^ ^ GTSDIA YSF ^ISGT GGLIKKGGSQ LQlSS 
SS^S?* VLYGNNKSDM RVETKGALIY NGAASGGSLN SDGIVYLADT 
DQSGANETVH IKGSLQLDGK GTLYTRLGKL LKVDGTAIIG GKLYMSArS 
GAGYLNSTGR RVPFLSAAKI GQDYSFFTNI ETDGGLLASL dSSSS 

gdtlsyyvrr gnaartasaa ahsapaglkh aveqggsnle nlmSS 

SSATPETVET AAADRTDMPG IRPYGATFRA AAAVQHANAA DGTOIFNsS 
ATVYADSTAA HADMQGRRLK AVSDGLDHNG TGLrSSSJ SSSSg* 
VEGKMRGSTQ TVGIAAKTGE NTTAAATLGM GRSTWSENSA 

AGIRHDAGDI gylkglfsyg ryknsisrst gadehaegsv S£S£ 

S^fc™ ^ TVBGGLR YDLLKQDAFA EKGSALGWSG S2£S£ 
IiAGLKLSQPL SDRAVLFATA GVERDLNGRD YTVTGGFTGA TAATGKTGAR 
^PHTRLVAG LGADVEFGNG WNGLARYSYA GSRQYGNhS R^G^ySS 
GGGGTGSATN DDDVKKAATV AIAAAYNNGQ EINGFKAGET IYDiSStI 

™ SSES KKVV ™ L ^ v«wSS SSaSSS 

KLTTKLADTD AALADTOAAL DATTNALNKL GENITTFAEE TKTNIVKIDE 
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1201 KLEAVADTVD KHAEAFNDIA DSLDETNTKA DEAVKTANEA KQTAEETKQN 

1251 VDAKVKAAET AAGKAEAAAG TANTAADKAE AVAAKVTDIK ADIATNKDNI 

1301 AKKANSADVY TREESDSKFV RIDGLNATTE KLDTRLASAE KSIADHDTRL 

1351 NGLDKTVSDL RKETRQGLAE QAALSGLFQP YNVGLEHHHH HH* 



Example 4 - hybrids ofAG741 
Protein 741 has the following sequence: 



1 VNRTAFCCLS LTTALILTA C SSGGGGVAAD IGAGLADALT APLDHKDKGL 

51 QSLTLDQSVR KNEKLKLAAQ GAEKTYGNGD SLNTGKLKND KVSRFDFIRQ 

101 IEVDGQLITL ESGEFQVYKQ SHSALTAFQT EQIQDSEHSG KMVAKRQFRI 

151 GDIAGEHTSF DKLPEGGRAT YRGTAFGSDD AGGKLTYTID FAAKQGNGKI 

201 EHLKSPELNV DLAAADIKPD GKRHAVTSGS VLYNQAEKGS YSLGIFGGKA 

251 QEVAGSAEVK TVNGIRHIGL AAKQ* 



AG741 thus has the following basic sequence: 

VAAD IGAGLADALT APLDHKDKGL 
QSLTLDQSVR KNEKLKLAAQ GAEKTYGNGD SLNTGKLKND KVSRFDFIRQ 
IEVDGQLITL ESGEFQVYKQ SHSALTAFQT EQIQDSEHSG KMVAKRQFRI 
GDIAGEHTSF DKLPEGGRAT YRGTAFGSDD AGGKLTYTID FAAKQGNGKI 
EHLKSPELNV DLAAADIKPD GKRHAVISGS VLYNQAEKGS YSLGIFGGKA 
QEVAGSAEVK TVNGIRHIGL AAKQ* 

AG741 was fused directly in-frame upstream of proteins 961, 961c, 983 and ORF46.1: 

AG741-961 

ATGGTCGCCGCCGACATCGGTGCGGGGCOTGCCGATGCACT 
TCTTTGACGCTGGATCAGTCCGTCAGGAAAAACGAGAAAC^ 

AACGGTGACAGCCTCAATACGGGCAAATTGAAGAACGACAAGGTCAGCCGTTTCGACTT^ 
GACGGGCAGCTCATTACCTTGGAGAGTGGAGAGTTCCAAGTATACAAACAAA 
ACCGAGCAAATAOVAGATTCGGAGCATTCCGGGAAGATGGTTGCGAAACGCCAGTT 
GAACATACATCTTTTGACAAGCTTCCCGAAGGCGGCAGGGCGACATATC 

GGCGGAAAACTGACCTACACCATAGATTTCGCCGCCAAGCAGGGAAACGGCAAAATCGAACATT 

CTCAATGTCGACCTGGCCGCCGCCGATATCAAGCCGGATGGAAAACGCCATGCCGTCATC^ 

AACCAAGCCGAGAAAGGCAGTTACTCCCTCGGTATCTTTGGCGGAAAAGCCCAGGAAGTTGCCGGCA 

AAAACCGTAAACGGCATACGCCATATCGGCCTTGCCGCCAAGCAACTCGAGGGTGGCG^^ 

AACGACGACGATGTTAAAAAAGCTGCCACTGTGGCCATTG^^ 

AAAGCTGGAGAGACCATCTACGACATTGATGAAGACGGCACAATTACCAAAAAAGACGCAACT^ 

GCCGACGACTTTAAAGGTCTGGGTCTGAAAAAAGTCGTGACTAACCTGACCAAAACCGTCAA 

GTCGATGCCAAAGTAAAAGCTGCAGAATCTGAAATAGAAAAGTO 

GCAGATACTGATGCCGCTCTGGATGCAACCACCAACGCCTTGAATAAATTGGGA^ 

GAGACTAAGACAAATATCGTAAAAATTGATGAAAAATTAGAAGCCGTG 

TTCAACGATATCGCCGATTCATTGGATGAAACCAACACTAAGGCAGACGAAGCCGTCAAAACCGCCAATGAAGCCA 

CAGACGGCCGAAGAAACCAAACAAAACGTCGATGCCAAAGTAAAAGCTGCA 

GCCGCTGGCACAGCTAATACTGCAGCCGACAAGGCCGAAGCTGTCGCTGCAAAAGTTACCG 

GCTACGAACAAAGATAATATTGCTAAAAAAGCAAACAGTGCCGACGTGTACACCAG 

GTCAGAATTGATGGTCTGAACGCTACTACCGAAAAATTGGACACACGCTTGGCTTCTGC 

(^CGATACTCGCCTGAACGGTTTGGATAAAACAGTGTCAGACCTGCGCAAAGAAACCCGCC 

GCCGCGCTCTCCGGTCTGTTCCAACCTTACAACGTGGGTCGGTTCAATC 

GAATCGGCAGTCGCCATCGGTACCGGCTTCCGCTTTACCGAAAACTTTGCCGCOUUVGCAGGCG 
TCGTCCGGTTCTTCCGCAGCCTACCATGTCGGCGTCAATTACGAGTGGCTCGA 

1 MVAADIGAGL ADALTAPLDH KDKGLQSLTL DQSVRKNEKL KLAAQGAEKT 

51 YGNGDSLNTG KLKNDKVSRF DFIRQIEVDG QLITLESGEF QVYKQSHSAL 

101 TAFQTEQIQD SEHSGKMVAK RQFRIGDIAG EHTSFDKLPE GGRATYRGTA 

151 FGSDDAGGKL TYT IDF AAKQ GNGKIEHLKS PELNVDLAAA DIKPDGKRHA 

201 VISGSVLYNQ AEKGSYSLGI FGGKAQEVAG SAEVKTVNGI RHIGLAAKQL 

251 EGGGGTGSAT NDDDVKKAAT VAIAAAYNNG QEINGFKAGE TIYDIDEDGT 

301 ITKKDATAAD VEADDFKGLG LKKWTNLTK TVNENKQNVD AKVKAAESEI 

351 EKLTTKLADT DAALADTDAA LDATTNALNK LGENITTFAE ETKTNIVKID 

401 EKLEAVADTV DKHAEAFNDI ADSLDETNTK ADEAVKTANE AKQTAEETKQ 
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451 NVDAKVKAAE TAAGKAEAAA GTANTAADKA EAVAAKVTDI KADIATNKDN 
501 IAKKANSADV YTREESDSKF VRIDGLNATT EKLDTRLASA EKSIADHDTR 
551 LNGLDKTVSD LRKETRQGLA EQAALSGLFQ PYNVGRFNVT AAVGGYKSES 
^ 601 AVAIGTGFRF TENFAAKAGV AVGTSSGSSA AYHVGVNYEW LEHHHHHH* 

AQ741-961C 

ATGGTCGCCGCCGACATCGGTGCGGGGCTTGCCGATGCACTAACCGCACCGCT^ 

TCTTTGACGCTGGATCAGTCCGTCAGGAAAAACGAGAAACTGAAGCTGGCGO 
1 0 AACGGTGACAGCCTCAATACGGGGAAATTGAAGAACGAGAA 

GACGGGCAGCTCATTACCTTGGAGAGTGGAGAGTTCCAAGTATACAAACAAAGCCATTC 

ACCGAGCAAATACAAGATTCGGAGCATTCCGGGAAGATGGTTGCGAAAC^ 

GAACATACATCTTTTGACAAGCTTCCCGAAGGCGGCAGGGCGACATATC 

GGCGGAAAACTGACCTACACCATAGATTTCGCCGCCAAGCAGGGAAACGGCAAAA 
1 5 CTCAATGTCGACCTGGCCGCCGCCGATATCAAGCCGGATGGAAAACGCCATC 

AACCAAGCCGAGAAAGGCAGTTACTCCCTCGGTATCTTTGGCGGAAAAGCCCAGGAACT 

AAAACCGTAAACGGCATACGCCATATCGGCCTTGCCGCC^GCAACTCGAGGGTGGC 

AACGACGACGATGOTAAAAAAGCTGCCACTGTGGCCATTGCT 

AAAGCTGGAGAGACCATCTACGACATTGATGAAGACGGCACAATTACCAAAAA^ 
20 GCCGACGACTTTAAAGGTCTGGGTCTGAAAAAAGTCGTGACTAACCTGACCAAAACCGT 

GTCGATGCCAAAGTAAAAGCTGCAGAATCTGAAATAGAAAAGTTAACAACC^ 

GCAGATACTGATGCCGCTCTGGATGCAACCACCAACGCCTTGAATAAATTGGGA 

GAGACTAAGACAAATATCGTAAAAATTGATGAAAAATTAGAAGCCGTGGCTGATACCGTCGACAAGCA 

TTCAACGATATCGCCGATTCATTGGATGAAACC^CACTAAGGCAGACGAA 
25 CAGACGGCCGAAGAAACCAAACAAAACGTCGATGCCAAAGTAAAAGCTGCAGAAAC 

GCCGCTGGCACAGCTAATACTGCAGCCGACAAGGCCGAAGCTGTCGCTGCAAAAGTTAC 

GCTACGAACAAAGATAATATTGCTAAAAAAGCAAACAGTGCCGACGTGTACACCAGAGAAGAGTC 

GTCAGAATTGATGGTCTGAACGCTACTACCGAAAAATTGGACACACGCTTGGCTO 

CACGATACTCGCCTGAACGGTTTGGATAAAACAGTGTCAGACCTGCG^ 
30 GCCGCGCTCTCCGGTCTGTTCCAACCTTAGAACGTGGGTC 

1 MVAADIGAGL ADALTAPLDH KDKGLQSLTL DQSVRKNEKL KLAAQGAEKT 

51 YGNGDSLNTG KLKNDKVSRF DFIRQIEVDG QLITLESGEF QVYKQSHSAL V 

101 TAFQTEQIQD SEHSGKMVAK RQFRIGDIAG EHTSFDKLPE GGRATYRGTA Hi 

35 151 FGSDDAGGKL TYTIDFAAKQ GNGKIEHLKS PELNVDLAAA DIKPDGKRHA 

201 VISGSVLYNQ AEKGSYSLGI FGGKAQEVAG SAEVKTVNGI RHIGLAAKQL V 

251 EGGGGTGSAT NDDDVKKAAT V A I AAA YNNG QEINGFKAGE TIYDIDEDGT 

301 ITKKDATAAD VEADDFKGLG LKKWTNLTK TVNENKQNVD AKVKAAESEI J t 

351 EKLTTKLADT DAALADTDAA LDATTNALNK LGENITTFAE ETKTNIVKID 

40 401 EKLEAVADTV DKHAEAFNDI ADSLDETNTK ADEAVKTANE AKQTAEETKQ 

451 NVDAKVKAAE TAAGKAEAAA GTANTAADKA EAVAAKVTDI KADIATNKDN : 

501 IAKKANSADV YTREESDSKF VRIDGLNATT EKLDTRLASA EKSIADHDTR 

551 LNGLDKTVSD LRKETRQGLA EQAALSGLFQ PYNVGLEHHH HHH* w: 



45 



A0741-983 



ATGGTCGCCGCCGACATCGGTGCGGGGCTTGCCGATGCACTAACCGCACCGCTCGACCATAAAGACAAAGGM 

TCTTTGACGCTGGATCAGTCCGTCAGGAAAAACGAGAAACTGAAGCTGGCGGCACAAGGTC 

AACGGTGACAGCCTCAATACGGGCAAATTGAAGAACGACAAGGTCAGCCGTTTC 

50 GACGGGCAGCTCATTACCTTGGAGAGTGGAGAGTTC 

ACCGAGCAAATACAAGATTCGGAGCATTCCGGGAAGATGGTTGCGAAACGCCAGTTCAG 
GAACATACATCTTTTGACAAGCTTCCCGAAGGCGGCAGGGCGACATATCGCGGGA 
GGCGGAAAACTGACCTACACCATAGATTTCGCCGCCAAGCAGGGAAACGGCAAAATCG 
CTCAATGTCGACCTGGCCGCCGCCGATATCAAGCCGGATGGA^ 

55 AACCAAGCCGAGAAAGGCAGTTACTCCCTCGGTATCTTTGGCG 

AAAACCGTAAACGGCATACGCCATATCGGCCTTGCCGCCAAGCAACTCGAGGGATCCGGCGGAGGCGG 
CCCGACTTCAATGCAGGCGGTACCGGTATCGGCAGCAACAGCAGAGCAACAACAGCGAAATCAGCAGCAG 
GCCGGTATCAAGAACGAAATGTGCAAAGACAGAAGCATGCTCTGTGCCGGTCGGGATC 
GATGCCAAAATCAATGCCCCCCCCCCGAATCTGCATACCGGAGACTTTCCAAACCCAAATC 

60 ATCAACCTCAAACCTGCAATTGAAGCAGGCTATACAGGACGCGGGGTAGAGGTAGGTATCGTC 
GTCGGCAGCATATCCTTTCCCGAACTGTATGGCAGAAAAG 

TATATGCGGAAGGAAGCGCCTGAAGACGGAGGCGGTAAAGACATTGAAGCTTCTTTC 
ACTGLAAGCAAAGCCGACGGATATCCGCCACGTAAAAGAAATCGGACACATCGA 
CGTTCCGTGGACGGCAGACCTGCAGGCGGTATTGCGCCCGATGCGACGCTACACATAATGAA 
65 AAGAACGAAATGATGGTTGCAGCCATCCGCAATGCATGGG 

AGTTTTGGAACAACATCGAGGGCAGGCACTGCCGACCTTTTCCAAATAGCCAACT 
TTGCTCGACTATTCCGGCGGTGATAAAACAGACGAGGGTATCCG 
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TACCACATCCGTAATAAAAACATGCT 

CTATTGCCATTTTATGAAAAAGACGCTCAAAAAGGCAOT 

AAACGGGAAATGTATGGAGAACCGGGTACAGAACCGCTTGAGTA^^ 

TGCCTGTCGGCACCCTATGAAGCAAGCGTCCGTTTCACCCGTACAAACCCGATTCAAATTG^ 

GCACCCATCGTAACCGGCACGGCGGCTCTGCTGCTGCAGAAATACCCGTGGATC 

TTGCTGACGACGGCTCkGGACATCGGTGCAGTC 

GCCATGAACGGACCCGCGTCCTTTCCGTTCGGCGACTTTACCGCCGATACGAAAGGTACATC 

TTCCGTAACGACATTTCAGGCACGGGCGGCCTGATCAAAAAAGGCGG^ 

TATACGGGCAAAACC&TTATCGAAGGCGGra^ 

AAAGGTGCGCTGATTTATAACGGGGCGGCATCCGGCGGCAGCC 

GACCAATCCGGCGCAAACGAAACCGTACACATCAAAGGC^^ 

TTGGGCAAACTGCTGAAAGTGGACGGTACGGCGATTATCGGCGGC 

GGCTATCTCAACAGTACCGGACGACGTGTTCCCTTCCTGAGTGCCGCCAAAATCGGGCAGGATT 

AACATCGAAACCGACGGCGGCCTGCTGGCTTCCCTCGACAGCGTCGAAAAAACAGC 

TCCTATTATGTCCGTCGCGGCAATGCGGCACGGACTGCTTC 

GCCGTAGAACAGGGCGGCAGCAATCTGGAAAACCTGATGGTCGAACTGGATGCCTCCGAAT 

ACGGTTGAAACTGCGGCAGCCGACCGCACAGATATGCCGGGCATCCGCCCCTACGGCGCAA^ 

GCCGTACAGCATGCGAATGCCGCCGACGGTGTACGCATCTT^ 

GCCGCCCATGCCGATATGCAGGGACGCCGCCTGAAAGCCGTATCGGACGGGTTGGACCAC^ 

GTCATCGCGCAAACCCAACAGGACGGTGGAACGTGGGAACAGGGCGGTG 

ACCGTCGGCATTGCCGCGAAAACCGGCGAAAATACGACAGCAOT 

GAAAACAGTGCAAATGCAAAAACCGACAGCATTAGTCTGTOT 

CTCAAAGGCCTGTTCTCCTACGGACGCTACAAAAAC^ 

AGCGTCAACGGCACGCTGATGCAGCTGGGCGCACTGGGC 

GTCGAAGGCGGTCTGCGCTACGACCTGCTOVAAC^GGATGCATTCGCCG 

AACAGCCTCACTGAAGGCACGCTGGTCGGACTCGCGGGTC 

TTTGCAACGGCGGGCGTGGAACGCGACCTGAACGGACGCGACTACACGGTAACGGGCGGCTTO 
GCAACCGGCAAGACGGGGGCACGCAATATGCCGCACACCCGTCTGGTTGCCG 
AACGGCTGGAACGGCTTGGCACGTTACAGCTACGCCGGTTCCAAA 
GGCTACCGGTTCCTCGAGCACCACCACCACCACCACTGA 

1 MVAADIGAGL ADALTAPLDH KDKGLQSLTL DQSVRKNEKL KLAAQGAEKT 

51 YGNGDSLNTG KLKNDKVSRF DFIRQIEVDG QLITLESGEF QVYKQSHSAL 

101 TAFQTEQIQD SEHSGKMVAK RQFRIGDIAG EHTSFDKLPE GGRATYRGTA 

151 FGSDDAGGKL TYTIDFAAKQ GNGKIEHLKS PELNVDLAAA DIKPDGKRHA 

201 VISGSVLYNQ AEKGSYSLGI FGGKAQEVAG SAEVKTVNGI RHIGLAAKQL 

251 EGSGGGGTSA PDFNAGGTGI GSNSRATTAK SAAVSYAGIK NEMCKDRSML 

301 CAGRDDVAVT DRDAKINAPP PNLHTGDFPN PNDAYKNLIN LKPAIEAGYT 

351 GRGVEVGIVD TGESVGSISF PELYGRKEHG YNENYKNYTA YMRKEAPEDG 

401 GGKDIEASFD DEAVIETEAK PTDIRHVKEI GHIDLVSHII GGRSVDGRPA 

451 GGIAPDATLH IMNTNDETKN EMMVAAIRNA WVKLGERGVR IVNNSFGTTS 

501 RAGTADLFQI ANSEEQYRQA LLDYSGGDKT DEGIRLMQQS DYGNLSYHIR 

551 NKNMLFIFST GNDAQAQPNT YALLPFYEKD AQKGIITVAG VDRSGEKFKR 

601 EMYGEPGTEP LEYGSNHCGI TAMWCLSAPY EASVRFTRTN PIQIAGTSFS 

651 APIVTGTAAL LLQKYPWMSN DNLRTTLLTT AQDIGAVGVD SKFGWGLLDA 

701 GKAMNGPASF PFGDFTADTK GTSDIAYSFR NDISGTGGLI KKGGSQLQLH 

751 GNNTYTGKTI IEGGSLVLYG NNKSDMRVET KGALIYNGAA SGGSLNSDGI 

801 VYIiADTDQSG ANETVHIKGS LQLDGKGTLY TRLGKLLKVD GTAIIGGKLY 

851 MSARGKGAGY LNSTGRRVPF LSAAKIGQDY SFFTNIETDG GLLASLDSVE 

901 KTAGSEGDTL SYYVRRGNAA RTASAAAHSA PAGLKHAVEQ GGSNLENLMV 

951 ELDASESSAT PETVETAAAD RTDMPGIRPY GATFRAAAAV QHANAADGVR 

1001 IFNSLAATVY ADSTAAHADM QGRRLKAVSD GLDHNGTGLR VIAQTQQDGG 

1051 TWEQGGVEGK MRGSTQTVGI AAKTGENTTA AATLGMGRST WSENSANAKT 

1101 DSISLFAGIR HDAGDIGYLK GLFSYGRYKN SISRSTGADE HAEGSVNGTL 

1151 MQLGAIiGGVN VPFAATGDLT VEGGLRYDLL KQDAFAEKGS ALGWSGNSLT 

1201 EGTLVGLAGL KLSQPLSDKA VLFATAGVER DLNGRDYTVT GGFTGATAAT 

1251 GKTGARNMPH TRLVAGLGAD VEFGNGWNGL ARYSYAGSKQ YGNHSGRVGV 

1301 GYRFLEHHHH HH* 



AG741-ORF46.1 

ATGGTCGCCGCCGACATCGGTGCGGGGCTTGCCGATGCACTAACCGCACCGCTCGACCATAAAGACAAAGGTTTGCAG 
TCTTTGACGCTGGATCAGTCCGTCAGGAAAAACGAGAAACTGAAGC 
AACGGTGACAGCCTCAATACGGGCAAATTGAAGAACGACAAGGTCAGCCGTTTCGACTTO 
GACGGGCAGCTCATTACCTTGGAGAGTGGAGAGTTC^ 

ACCGAGCAAATACAAGATTCGGAGCATTCCGGGAAGATGGTTGCGAAACGCCAGTT 
GAACATACATCTTTTGACAAGCTTCCCGAAGGCGG 
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GGCGGAAAACTGACCTACACCATAGATTTCGCCGCCAAGCAGGGAAACGGCAA^ 

CTCAATGTCGACCTGGCCGCCGCCGATATCAAGCCGGATGGAAAACGCCATGCCGTC^ 

AACCAAGCCGAGAAAGGCAGTTACTCCCTCGGTATCTTTGGC^^ 

AAAACCGTAAACGGCATACXXTCATATCGGCCTTGCCGCCAAGCAACTCGACGGTC 

TTGGCAAACGATTCTTTTATCCGGCAGGTTCTCGACCGTCA 

AGCAGGGGGGAACTTGCCGAGCGGAGCGGCG&TATCGGA 

ATTCAACAGGCGGCCATTAAAGGAAATATCGGCTACATT 

TTCGACAACCATGCCTCACATTCCGATTCTGATGAAGCCGGTAGTC 

TGGGACGGATACGAACACCATCCCGCCGACGGCTATGACGGGCCACAGGGCGGCGGCTATCCCGCTC 

AGGGATATATACAGCTACGACATAAAAGGCGTTGCCCAAAATATCCGCCTCAACCTGAC^ 

CAACGGCTTGCCGACCGTTTCCACAATGCCGGTAGTATGCTGACGCAAGGAGTA^ 

CGATACAGCCCCGAGCTGGACAGATCGGGCAATGCCGCCGAAC^ 

ATCGGCGCGGCAGGAGAAATTGTCGGCGGAGGCGATGCCGTGC^ 

CACGGCTTGGGTCTGCTTTCCACCGAAAACAAGATGGC^ 

TATGCCGCAGCAGCCATCCGCGATTGGGCAGTCCAAAACCC 

TTTATGGCAGCCATCCCCATCAAAGGGATTGGAGCTGTTCGGGGA 

ATCAAGCGGTCGCAGATGGGCGCGATCGCATTGCCGAAAGGGAAATCCGCCG 

TACGCCAAATACCCGTCCCCTTACCATTCCCGAAATATCCGTTCAAACTTGGAGCAGCGTT 

ACCTCCTCAACCGTGCCGCCGTCAAACGGCAAAAATGTCAAACTGGCAGACCAACGCCACCCGAA 

TTTGACGGTAAAGGGTTTCCGAATTTTGAGAAGCACGTGAAAT^ 

1 MVAADIGAGL ADALTAPLDH KDKGLQSLTL DQSVRKNEKL KLAAQGAEKT 

51 YGNGDSLNTG KLKNDKVSRF DFIRQIEVDG QLITLBSGEF QVYKQSHSAL 

101 TAFQTEQIQD SEHSGKMVAK RQFRIGDIAG EHTSFDKLPE GGRATYRGTA 

151 FGSDDAGGKL TYTIDFAAKQ GNGKIEHLKS PELNVDLiAAA DIKPDGKRHA 

201 VISGSVLYNQ AEKGSYSLGI FGGKAQEVAG SAEVKTVNGI RHIGLAAKQL 

251 DGGGGTGSSD LANDSFIRQV LDRQHFEPDG KYHLFGSRGE LAERSGHIGL 

301 GKIQSHQLGN LMIQQAAIKG NIGYIVRFSD HGHEVHSPFD NHASHSDSDE 

351 AGSPVDGFSL YRIHWDGYEH HPADGYDGPQ GGGYPAPKGA RDIYSYDIKG 

401 VAQNIRLNLT DNRSTGQRLA DRFHNAGSML TQGVGDGFKR ATRYSPEUDR 

451 SGNAARAFNG TAD3VKNIIG AAGEIVGAGD AVQGISBGSN IAVMHGIiGLL 

501 STENKMARIN DIiADMAQLKD YAAAAIRDWA VQNPNAAQGI EAVSNIFMAA 

551 IPIKGIGAVR GKYGLGGITA HPIKRSQMGA IALPKGKSAV SDNFADAAYA 

601 KYPSPYHSRN IRSNLEQRYG KENITSSTVP PSNGKNVKLA DQRHPKTGVP . .. 

651 FDGKGFPNFE KHVKYDTLEH HHHHH* T 

Example 5 - hybrids of 287 

Expression of 287 as full-length with a C-terminal His-tag, or without its leader peptide but 
with a C-terminal His-tag, gives fairly low expression levels. Better expression is achieved 
using a N-terminal GST-fusion. As an alternative to using GST as an N-terminal fiision 
partner, 287 was placed at the C-terminus of protein 919 ('919-287'), of protein 953 ('953- 
287'), and of proteins ORF46.1 ('ORF46.1-287'). In both cases, the leader peptides were 
deleted, and the hybrids were direct in-frame fusions. 

To generate the 953-287 hybrid, the leader peptides of the two proteins were omitted by 
designing the forward primer downstream from the leader of each sequence; the stop codon 
sequence was omitted in the 953 reverse primer but included in the 287 reverse primer. For 
the 953 gene, the 5' and the 3' primers used for amplification included a Ndel and a BamHl 
restriction sites respectively, whereas for the amplification of the 287 gene the 5' and the 3' 
primers included a BamHl and a Xhol restriction sites respectively. In this way a sequential 
directional cloning of the two genes in pET21b+, using NdeVBamHL (to clone the first gene) 
and subsequently BamHL-Xhol (to clone the second gene) could be achieved. 
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The 919-287 hybrid was obtained by cloning the sequence coding for the mature portion of 
287 into the Xhol site at the 3'-end of the 919-His clone in pET21b+. The primers used for 
amplification of the 287 gene were designed for introducing a Sail restriction site at the 5'- 
and a Xhol site at the 3'- of the PCR fragment Since the cohesive ends produced by the Sail 
and Xhol restriction enzymes are compatible, the 287 PCR product digested with Satl-Xhol 
could be inserted in the pET21b-919 clone cleaved with Xhol 

The ORF46.1-287 hybrid was obtained similarly. 

The bactericidal efficacy (homologous strain) of antibodies raised against the hybrid proteins 
was compared with antibodies raised against simple mixtures of the component antigens: 





Mixture with 287 


Hybrid with 287 


919 


32000 


16000 


953 


8192 


8192 


ORF46.1 


128 


8192 



Data for bactericidal activity against heterologous MenB strains and against serotypes A and 
C were also obtained for 919-287 and 953-287: 





919 


953 


ORF46.1 


Strain 


Mixture 


Hybrid 


Mixture 


Hybrid 


Mixture 


Hybrid 


MC58 


512 


1024 


512 


1024 




1024 


NGH38 


1024 


2048 


2048 


4096 




4096 


BZ232 


512 


128 


1024 


16 






MenA (F6124) 


512 


2048 


2048 


32 




1024 


MenC(Cll) 


>2048 


n.d. 


>2048 


n.d. 




n.d. 


MenC (BZ133) 


>4096 


>8192 


>4096 


<16 




2048 



Hybrids of ORF46.1 and 919 were also constructed. Best results (four-fold higher titre) were 
achieved with 919 at the N-terminus. 

Hybrids 919-5 19His, ORF97-225His and 225-ORF97His were also tested. These gave 
moderate ELISA fitres and bactericidal antibody responses. 

As hybrids of two proteins A & B may be either NH2-A-B-COOH or NHr-B-A-€OOH, 
the "reverse" hybrids with 287 at the N-terminus were also made, but using AG287. A panel 
of strains was used, including homologous strain 2996. FCA was used as adjuvant: 
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287 & 919 


287 & 953 


287 & ORF46.1 


Strain 


JG287-919 


9J9-287 


AG287-953 


953-287 


AG287-461 


46 1-287 


2996 


128000 


16000 


65536 


8192 


16384 


8192 


BZ232 


256 


128 


128 


<4 


<4 


<4 ! 


1000 


2048 


<4 


<4 


<4 




<*4 


MC58 


8192 


1024 


16384 


1024 


512 


128 


NGH38 


32000 


2048 


>2048 


4096 


16384 


4096 


394/98 


4096 


32 


256 


128 


128 


16 


MenA (F6124) 


32000 


2048 


>2048 


32 


8192 


1024 


MenC (BZ133) 


64000 


>8192 


>8192 


<16 


8192 


2048 



Better bactericidal titres are generally seen with 287 at the N-terminus. 

When fused to protein 961 [NH2-AG287-961-COOH - sequence shown above], the resulting 
protein is insoluble and must be denatured and renatured for purification. Following 
renaturation, around 50% of the protein was found to remain insoluble. The soluble and 
5 insoluble proteins were compared, and much better bactericidal titres were obtained with the 
soluble protein (FCA as adjuvant): 





2996 


BZ232 


MC58 


NGH38 


F6124 


BZ133 


Soluble 


65536 


128 


4096 


>2048 


>2048 


4096 


Insoluble 


8192 


<4 


<4 


16 


n.d. 


n.d. 



Titres with the insoluble form were, however, improved by using alum adjuvant instead: - 1 



Insoluble 


32768 


128 


4096 


>2048 


>2048 


2048 



10 961c was also used in hybrid proteins (see above). As 961 and its domain variants direct 
efficient expression, they are ideally suited as the N-terminal portion of a hybrid protein. 



Example 23 - further hybrids 

Further hybrid proteins of the invention are shown in the drawings and have the sequences 
set out below. These are advantageous when compared to the individual proteins: 

15 ORF46.1-741 

ATGTCAGATTTGGCAAACGATTC 
CTATTCGGCAGCAGGGGGGAACT^ 

AACCTGATGATTCAACAGGCGGCCATTAAAGGAAATATCGGCTACATTGTCCGC 
CATTCCCCCTTCGACAACCATGCCTCACATTC 
20 CGCATCCATTGGGACGGATACGAAGACCATCCCGKXGACGGCTATC 
AAAGGCGCGAGGGATATATACAGCTACGACATAAAAGGCGTTC 
AGCACCGGACAACGGCTTGCCGACCGTTTCCACAATGCCGGTAGTATGCTGAC 
CGCGCCACCCGATACAGCCCCGAGCTGGACAGATCGGGCAATGCCGC^ 
AAAAACATCATCGGCGCGGCAGGAGAAATTGTCGGCGCAGGCGATGCCGTG 
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GCTGTCATGCACGGCTTGGGTCTGCTTTCCACCGAAAACAAGATGGCGCGCATCAACGATTTC 
CTCAAAGACTATGCCGCAGCAGCCATC 

AGCAATATCTTTATGGCAGCCATCCCCATCAAAGGGATTGGAGCTGTTC 

GCAC^TCCTATCAAGCGGTCGCAGATGGGCGCGATCGCATTGCCGAAAGGGAAATCCG^ 

GATGCGGCATACGCCAAATACCCGTCCCCTTACCATTC^ 

GAAAACATCACCTCCTCAACCGTGCCGCCGTGA^ 

GGCGTACCGTTTGACGGTAAAGGGTTTCCGAATTTTC 

GTCGCCGCCGACATCGGTGCGGGGCTTGCCGATGCACTAACCGCACCGCTCGACCAT 

TTGACGCTGGATCAGTCCGTCAGGAAAAACGAGA^^ 

GGTGACAGCCTCAATACGGGCAAATTGAAGAACGACAAGGTCAGCC^ 

GGGCAGCTCATTACCTTGGAGAGTGGAGAGTTCCAAGT^^ 

GAGCAAATAC^GATTCGGAGCATTCCGGGAAGATGGTTGCGAAACGC 

CATACATCTTTTGACAAGCTTCCCG^^ 

GGAAAACTGACCTACACGkTAGATTTCGCCG^ 

AATGTCGACCTGGC CGCCGCCGATATCAAGCCGGATGGAAAACGC CATGCCGTCATC AGCGGTTC 

CAAGCCGAGAAAGGCAGTTACTCCCTCGGTATCTTTGGCGGAA^GCC 

ACCGTAAACGGCATACGCCATATCGGCCTTGCCGCCAA 

1 MSDLANDSFI RQVLDRQHFE PDGKYHLFGS RGELAERSGH IGLGKIQSHQ 

51 LGNLMXQQAA IKGNIGYIVR PSDHGHEVHS PFDNHASHSD SDEAGSPVDG 

101 FSLYRIHWDG YEHHPADGYD GPQGGGYPAP KGARDIYSYD IKGVAQNIRL 

151 NLTDNRSTGQ RLADRFHNAG SMLTQGVGDG FKRATRYSPE LDRSGNAABA 

201 FNGTADIVKN IIGAAGEIVG AGDAVQGISE GSNIAVMHGL GLLSTENKMA 

251 RINDLADMAQ LKDYAAAAIR DWAVQNPNAA QGIEAVSNIF MAAIPIKGIG 

301 AVRGKYGLGG ITAHPIKRSQ MGAIALPKGK SAVSDNFADA AYAKYPSPYH 

351 SRNIRSNLEQ RYGKENITSS TVPPSNGKNV KLADQRHPKT GVPFDGKGFP 

401 NFEKHVKYDT GSGGGGVAAD IGAGLADALT APLDHKDKGL QSLTLDQSVR 

451 KNEKLKLAAQ GAEKTYGNGD SLNTGKLKND KVSRFDFIRQ IEVDGQLITL 

501 ESGEFQVYKQ SHSALTAFQT EQIQDSEHSG KMVAKRQFRI GDIAGEHTSF 

551 DKLPEGGRAT YRGTAFGSDD AGGKLTYTID FAAKQGNGKI EHLKSPELNV 

601 DLAAADIKPD GKRHAVISGS VLYNQAEKGS YSLGIFGGKA QEVAGSAEVK 

651 TVNGIRHIGL AAKQLEHHHH HH* 



ORF46. 1-961 

ATGTCAGATTTGGCAAACGATTCTTTTATCCGGC^ 

CTATTCGGCAGCAGGGGGGAACTTGCCGAGCGCAGCGGCCATATC 

AACCTGATGATTCAACAGGCGGCCATTAAAGGAAATATCGGCTA 

CATTCCCCCTTCGACAACCATGCCTCACATTCCGATTCTGATGAAGCCGGTAGTCCCGTT^ 

CGCATCCATTGGGACGGATACGAACACCATCCCGCCGACGGCTATGACGGGCCACAGGGCGGCGGCTATCCCGCTCCC 
AAAGGCGCGAGGGATATATACAGCTACGACATAAAAGGCGTTGCCCAAAATATCCGCCTCAACCTGACCGACAACCGC 
AGCACCGGACAACGGCTTGCCGACCGTTTCCACAATGCCGGTAGTATGCTGACGCAAG 

CGCGCCACCCGATACAGCCCCGAGCTGGACAGATCGGGCAATGCCGCCGAAGCCTTCAACGGCACTGCAGATATCG 

AAAAACATCATCGGCGCGGCAGGAGAAATTGTCGGCGCAGGCGATGCCGTGCAGGGCATAAGCGAAGG 

GCTGTCATGCACGGCTTCGGTCTGCTTTCCACCGAAAACAAGATGGCGCGCATC 

CTCAAAGACTATGCCGC?^GCAGCCATCCGCGATTGGGCAGTCCAAAAC 

AGCAATATCTTOATGGCAGCCATCCCCATCAAAGGGATTGGAGCT 

GCACATCCTATCAAGCGGTCGCAGATGGGCGCGATCGCATTGCCGAAAGGGAAATCCGCCG^ 

GATGCGGCATACGCCAAATACCCGTCCCCT^ 

GAAAACATCACCTCCTCAACCGTGCCGCCGTCAAACOT 

GGCGTACCGTTTGACGGTAAAGGGTTTCCGAATT 

GCCACAAACGACGACGATGTTAAAAAAGCTGCCAC^ 

GGTTTCAAAGCTGGAGAGACCATCTACGACATTGATGAAGACGGCACAATTA 
GTTGAAGCCGACGACTTTAAAGGTCTGGGTCTGAAAAAAGTCGTGACTAACCTGACCAA 
CAAAACGTCGATGCCAAAGTAAAAGCTGCAGAATCTGAAA^ 
GCTTTAGCAGATACTGATGCCGCTCTGGATGCAA 

GCTGAAGAGACTAAGACAAATATCGTAAAAATTGATGAAAAATTAGAAGCCGTGGCTGATACCGTCGA 

GAAGCATTCAACGATATCGCCGATTCATTGGATGAAACCAACACTAAGG 

GCCAAACAGACGGCCGAAGAAACCAAACAAAACGTCGATGCCA^^ 

GAAGCTGCCGCTGGCACAGCTAATACTGCAGCCGACAAGGCCGAAGCTGTCGCTGCAAAAGT^ 

GATATCGCTACGAACAAAGATAATATTGCTAAAAAAGCAAACAGTGCCGACGTGTACACCAGAGAAGAGTCTC 

AAATTTGTCAGAATTGATGGTCTGAACGCTACTACCGAAAAATO 

GCCGATCACGATACTCGCCTGAACGGTTTGGATAAAACAGTGTCAGAC^ 

GAACAAGCCGCGCTCTCCGGTCTGTTCCAACCTTACAACGTGGGTC 

AAATCCGAATCGGCAGTCGCCATCGGTACCGGCTTCCGCOT 

GGCACTTCGTCCGGTTCTTCCGXIAGCCTAC^ 

TGA 
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1 MSDLANDSFI RQVLDRQHFE 

51 LGNLMIQQAA IKGNIGYIVR 

101 FSLYRIHWDG YEHHPADGYD 

151 NLTDNRSTGQ RLADRFHNAG 

201 FNGTADIVKN IIGAAGEIVG 

251 RINDLADMAQ LKDYAAAAIR 

301 AVRGKYGLGG ITAHPIKRSQ 

351 SRNIRSNLEQ RYGKENITSS 

401 NFEKHVKYDT GSGGGGATND 

451 YDIDEDGTIT KKDATAADVE 

501 VKAAJESEIEK LTTKLADTDA 

551 KTNIVKIDEK LEAVADTVDK 

601 QTAEETKQNV DAKVKAAETA 

651 DIATNKDNIA KKANSADVYT 

701 SIADHDTRLN GLDKTVSDLR 

751 VGGYKSESAV AIGTGPRFTE 

801 HHHHHH* 



PDGKYHLFGS RGELAERSGH IGLGKIQSHQ 
FSDHGHEVHS PFDNHASHSD SDEAGSPVDG 
GPQGGGYPAP KGARDIYSYD IKGVAQNIRL 
SMLTQGVGDG FKRATRYSPE LDRSGNAAEA 
AGDAVQGISE GSNIAVMHGL GLLSTENKMA 
DWAVQNPNAA QGIEAVSNIF MAAIPIKGIG 
MGAIALPKGK SAVSDNFADA AYAKYPSPYH 
TVPPSNGKNV KLADQRHPKT GVPFDGKGFP 
DDVKKAATVA IAAAYNNGQE INGFKAGETI 
ADDFKGLGLK KWTNLTKTV NENKQNVDAK 
ALADTDAALD ATTNALNKLG ENITTFAEET 
HAEAFNDIAD SLDETNTKAD EAVKTANEAK 
AGKAEAAAGT ANTAADKAEA VAAKVTDHCA 
REESDSKFVR IDGLNATTEK LDTRLASAEK 
KETRQGLAEQ AALSGLFQPY NVGRFNVTAA 
NFAAKAGVAV GTSSGSSAAY HVGVNYEWLE 



ORF46.1-961c 

ATGTCAGATTTGGCAAACGATTCTTTTATCCGGCAGGTTCTCGACCGTCAGC^ 
CTATTCGGCAGCAGGGGGGAACTTGCCGAGCGCAGCGGCCATATC 
AACCTGATGATTCAACAGGCGGCCATTAAAGGAAATATCGGCTACATTC 
CATTCCCCCTTCGACAACCATGCCTCACATTC^^ 

CGCATCCATTGGGACGGATACGAACACCATCCCCXrCGACGGCTATGACGGGCCACAG 

AAAGGCGCGAGGGATATATACAGCTACGACATAAAAGGCGTTGCCCAAAATATCCGCCTCAACCTGACCGAC^ 

AGCACOSGACAACGGCTTGCCGACCGTTTCCACAATGCCGGTAGTATGC 

CGCGCCACCCGATACAGCCCCGAGCTGGACAGATCGGGCA^ 

AAAAACATCATCGGCGCGGCAGGAGAAATTGTCGGCGCAGGCGATGCCGTGCAGGGCATAAGCGAA 
GCTGTCATGCACGGCTTGGGTCTGCTTTCCACCGAAAACAAGATGGCGCGC^ 
CTCAAAGACTATGCCGCAGCAGCCATCCGCGATTGGGCAGTC 
AGCAATATCTTTATGGCAGCCATCCCCATCAAAGGGATT^ 

GCACATCCTATCAAGCGGTCGCAGATGGGCGCGATCGCATTGCCGAAAGGGAAATC 

GATGCGGCATACGCCAAATACCCGTCCCCTTACCATTCCCGAAATATCCGTTCAAACTTGGAGCAGCGTTACGGCA^ 

GAAAACATCACCTCCTCAACCGTGCCGCCGTCAAACGGCAAAAATGTCAAACTG^ 

GGCGTACCGTTTGACGGTAAAGGGTTTCCGAATTTTGAGAAGCACGTC 

GCCACAAACGACGACC^TGraAAAAAAGCTGCCACTGTGGCCATTGCTGCTGCC 

GGTTTCAAAGCTGGAGAGACCATCTACGACATTGATGAAGACG^ 

GTTGAAGCCGACGACTTTAAAGGTCTGGGTCTGAAAAAAGTCGTGACTAACCTGACCAAAACCGTCAATO 
CAAAACGTCGATGCCAAAGTAAAAGCTGCAGAATCTGAAATAGAAAAGTTAACAACCAAGTTAGCAGACACTGATG 
GCTTTAGCAGATACTGATGCCGCTCTGGATGCAACCACCAACGCCOTGA^ 
GCTGAAGAGACTAAGACAAATATCGTAAAAATTGATGAAAAATTAGAAGCCGTGGCTG^ 

G AAGCATTCAACGATATCGC CGATTCATTGGATGAAACC AACAC TAAGGCAGACGAAGCCGTC AAAACCGCCAATGAA 

GCCAAACAGACGGCCGAAGAAACCAAACAAAACGTCGATGCCAAAGTAAAAGCTGCAGAAACT 

GAAGCTGCCGCTGGGA(^GCTAATACTGCAGCCGACAAGGCCGAAGCTGTCGCTGCAAAAGT 

GATATCGCTACGAACAAAGATAATATTGCTAAAAAAGCAAACAGTGCCGACGTGTACACCAGAGAAGAGTCTGACAGC 
AAATTTGTCAGAATTGATGGTCTGAACGCTACTACCGAAAAATTGGACACACGCTO 

GCCGATCACGATACTCGCCTGAACGGTTTGGATAAAACAGTGTCAGACCTGCGCAAAGAAACCCGCCAAGGCCTTGCA 
GAACAAGCCGCGCTCTCCGGTCTGTTCCAACCTTACAAC^ 

1 MSDLANDSFI RQVLDRQHFE PDGKYHLFGS RGELAERSGH IGLGKIQSHQ 

51 LGNLMIQQAA IKGNIGYIVR FSDHGHEVHS PFDNHASHSD SDEAGSPVDG 

101 FSLYRIHWDG YEHHPADGYD GPQGGGYPAP KGARDIYSYD IKGVAQNIRL 

151 NLTDNRSTGQ RLADRFHNAG SMLTQGVGDG FKRATRYSPE LDRSGNAAEA 

201 FNGTADIVKN IIGAAGEIVG AGDAVQGISE GSNIAVMHGL GLLSTENKMA 

251 RINDLADMAQ LKDYAAAAIR DWAVQNPNAA QGIEAVSNIF MAAIPIKGIG 

301 AVRGKYGLGG ITAHPIKRSQ MGAIALPKGK SAVSDNFADA AYAKYPSPYH 

351 SRNIRSNLEQ RYGKENITSS TVPPSNGKNV KLADQRHPKT GVPFDGKGFP 

401 NFEKHVKYDT GSGGGGATND DDVKKAATVA IAAAYNNGQE INGFKAGETI 

451 YDIDEDGTIT KKDATAADVE ADDFKGLGLK KWTNLTKTV NENKQNVDAK 

501 VKAAESEIEK LTTKLADTDA ALADTDAALD ATTNALNKLG ENITTFAEET 

551 KTNIVKIDEK LEAVADTVDK HAEAFNDIAD SLDETNTKAD EAVKTANEAK 

601 QTAEETKQNV DAKVKAAETA AGKAEAAAGT ANTAADKAEA VAAKVTDIKA 

651 DIATNKDNIA KKANSADVYT REESDSKFVR IDGLNATTEK LDTRLASAEK 

701 SIADHDTRLN GLDKTVSDLR KETRQGLAEQ AALSGLFQPY NVGLEHHHHH 

751 H* 
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961-ORF46.1 

ATGGCCACAAACGACGACGATGTTAAAAAAGCTGCCACTGTGGCCATTC 
5 AACGGTTTCAAAGCTGGAGAGACCATCTACGACATTGATGAAGACGGC^ 
GATGTTGAAGCCGACGACTTTAAAGGTCTGGGTCTGAAAAAAGTC^ 
AAAGAAAACGTCGATGCCAAAGTAAAAGCTGCAGAATCTGAAA 
GCCGCTTTAGCAGATACTGATGCCGCTCTGGATGCAACC^ 

TTTGCTGAAGAGACTAAGACAAATATCGTAAAAATTGATGAAAAATTAGAAGCCGTGGCT 
10 GCCGAAGCATTCAACGATATCGCCGATTCATTGGATGAAACCAACACTAAGG 

GAAGCCAAACAGACGGCCGAAGAAACCAAACAAAACGTCGATGCCAA 

GCCGAAGCTGCCGCTGGCACAGCTAATACTGCAGCCGACAAGGCCGAAGCTGTCG^ 

GCTGATATCGCTACGAACAAAGATAATATTGCTAAAAAAG 

AGCAAATTTGTCAGAATTGATGGTCTGAACGCTACTACC^ 
1 5 ATTGCCGATCACGATACTCGCCTGAACGGTTTGGATAAAA 

GCAGAACAAGCCGCGCTCTCCGGTCTGTTCCAACCTTACAACGTGGGTCGGTTCAATGT 

TACAAATCCGAATCGGCAGTCGCCATCGGTACCGGCTTCCGCTO 

GTCGGCACTTCGTCCGGTTCTTCCGCAGCCTACCATGTCGGCGTCAATTACGAGTGGGGATCCGG 

GATTTGGCAAACGATTCTTTTATCCGGCAGGTTCTC 
20 GGCAGCAGGGGGGAACTTGCCGAGCGCAGCGGCCATATCGGATTGGGAAAAA 

ATGATTCAACAGGCGGCCATTAAAGGAAATATCGGCTACATTGTCCGCTTTTCCGAT 

CCCTTCGACAACCATGCCTCACATTCCGATTCTGATGAA^ 

CATTGGGACGGATACGAACACCATCCCGCCGACGGCTAT 

GCGAGGGATATATACAGCTACGACATAAAAGGCGTTGCCCAAAA 
25 GGACAACGGCTTGCCGACCGTTTCCACAATGCCGGTAGTATGCTGACGCAAGGACT 

ACCCGATACAGCCCCGAGCTGGACAGATCGGGCAATGCCGC^ 

ATCATCGGCGCGGCAGGAGAAATTGTCGGCGCAGGCGATGCCGTGCAGGGCATAAGCGAAG 

ATGGACGGCTTGGGTCTGCTTTCCACCGAAAACAAGATGGC 

GACTATGCCGCAGCAGCCATCCGCGATTGGGCAGTCCAAA^ 
30 ATCTTTATGGCAGCCATCCCCATCAAAGGGATTGGAGCTGTTC 

CCTATCAAGCGGTCGCAGATGGGCGCGATCGCATTGCCGAAAGGGAAATC 

GCATACGCCAAATACCCGTCCCCTTACCATTCCCGAAATATCCGTTC^ 

ATCACCTCCTCAACCGTGCCGCCGTCAAACGGCAAAAATGTCAAACTCGCA 

CCGTTTGACGGTAAAGGGTTTCCGAATTTTGAGAAGCACGTGAAATATGA 
35 TGA 

1 MATNDDDVKK AATVAIAAAY NNGQEINGFK AGETIYDIDE DGTITKKDAT 
51 AADVEADDFK GLGLKKWTN LTKTVNENKQ NVDAKVKAAE SEIEKLTTKL 
101 ADTDAALADT DAALDATTNA LNKLGENITT FAEETKTNIV KIDEKLEAVA 

40 151 DTVDKHAEAF NDIADSLDET NTKADEAVKT ANEAKQTAEE TKQNVDAKVK 

201 AAETAAGKAE AAAGTAOTAA DKAEAVAAKV TDIKADIATN KDNIAKKANS 
251 ADVYTREESD SKFVRIDGLN ATTEKLDTRL ASAEKSIADH DTRLNGLDKT 
301 VSDLRKETRQ GIiAEQAALSG LFQPYNVGRF NVTAAVGGYK SESAVAIGTG 
351 FRFTENFAAK AGVAVGTSSG SSAAYHVGVN YEWGSGGGGS DLANDSFIRQ 

45 401 VLDRQHFEPD GKYHLFGSRG ELAERSGHIG LGKIQSHQLG NLMIQQAAIK 

451 GNIGYIVRFS DHGHEVHSPF DNHASHSDSD EAGSPVDGFS LYRIHWDGYE 
501 HHPADGYDGP QGGGYPAPKG ARDIYSYDIK GVAQNIRLNL TDNRSTGQRL 
551 ADRFHNAGSM LTQGVGDGFK RATRYSPELD RSGNAAEAFN GTADIVKNII 
601 GAAGE IVG AG DAVQGISEGS NIAVMHGLGL LSTENKMARI NDLADMAQLK 

50 651 DYAAAAIRDW AVQNPNAAQG IEAVSNIFMA AIPIKGIGAV RGKYGLGGIT 

701 AHPIKRSQMG AIALPKGKSA VSDNFADAAY AKYPSPYHSR NIRSNLEQRY 
751 GKENITSSTV PPSNGKNVKL ADQRHPKTGV PFDGKGFPNF EKHVKYDTLE 
801 HHHHHH* 

55 961-741 

ATGGCCACAAACGACGACGATGTTAAAAAAGCTGCCACTGTGGCCATTGCTGCTGCCTACAACAATGGCC^ 
AACGGTTTCAAAGCTGGAGAGACCATCTACGACATTGATGAAGACGGCACAATTACCAAAAAAG 
GATGTTGAAGCCGACGACTTTAAAGGTCTGGGTCTGAAAAAAGTCGTGACT 
AAACAAAACGTCGATGCCAAAGTAAAAGCTGCAGAATCTGAAAT^ 
60 GCCGCTTTAGCAGATACTGATGCCGCTCTGGATGCAACCA 
TTTGCTGAAGAGACTAAGACAAATATCGTAAAAATTGATC 
GCCGAAGCATTCAACGATATCGCCGATTCATTGGATGAAAC^ 

GAAGCCAAACAGACGGCCGAAGAAACCAAACAAAACGTCGATGCGAAAGTAAAAGCTG 
GCCGAAGCTGCCGCTGGCACAGCTAATACTGCAGCCGAGAAGGCCGAAGCTGTCGCTG 
65 GCTGATATCGCTACGAACAAAGATAATATTGCTAAAAAAGCAAA 
AGCAAATTTGTCAGAATTGATGGTCTGAACGCT 
ATTGCCGATCACGATACTCGCCTGAACGGTTTGGATAAAACAGTC 
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GCAGAACAAGCCGCGCTCTCCGGTCTGTTCCAACCTTACAACGTGGG 

TACAAATCCGAATCGGCAGTCGCCATCGGTACCGGCTTCCGCT^ 

GTCGGCACTTCGTCCGGTTCTTCCGCAGCCTACCATC^ 

GCCGCCGACATCGGTGCGGGGCTTGCCGATGCACTAA^^ 

ACGCTGGATCAGTCCGTCAGGAAAAACGAGAAACTGAAGCTGGCGGCAC^ 

GACAGCCTCAATACGGGCAAATTGAAGAACGACAAGGTCAGC^ 

CAGCTCATTACCTTGGAGAGTGGAGAGTTCCAAGTATACAAACAAAGCCATTC 

CAAATACAAGATTCGGAGCATTCCGGGAAGATGGTTGCGAAACGCCAGOT 

ACATCTTTTGACAAGCTTCCCGAAGGCGGCAGGGCGACATATC 

AAACTGACCTACACCATAGATTTCGCCGCCAAGCAGGGAAAC^ 

GTCGACCTGGCCGCCGCCGATATCAAGCCGGATGGAAAACGCCMX3CC 

GCCGAGAAAGGCAGTTACTCCCTCGGTATCTTTGGC^ 

GTAAACGGCATACGCCATATCGGCCTTGCCGCCAAGCAACTCGAGCACCACCAC 

1 MATNDDDVKK AATVAIAAAY NNGQEINGFK AGETIYDIDE DGTITKKDAT 

51 AADVEADDFK GLGLKKWTN LTKTVNENKQ NVDAKVKAAE SEIEKLTTKL 

101 ADTDAALADT DAALDATTNA LNKLGENITT PAEETKTNIV KIDEKLEAVA 

151 DTVDKHAEAF NDIADSLDET NTKADEAVKT ANEAKQTAEE TKQNVDAKVK 

201 AAETAAGKAE AAAGTANTAA DKAEAVAAKV TDIKADIATN KDNIAKKANS 

251 ADVYTREBSD SKFVRIDGLN ATTEKLDTRL ASAEKSIADH DTRLNGLDKT 

301 VSDLRKETRQ GLAEQAALSG IiFQPYNVGRF NVTAAVGGYK SESAVAIGTG 

351 FRFTENFAAK AGVAVGTSSG SSAAYHVGVN YEWGSGGGGV AADIGAGLAD 

401 ALTAPLDHKD KGLQSLTLDQ SVRKNEKLKL AAQGAEKTYG NGDSLNTGKL 

451 KNDKVSRFDF IRQIEVDGQL ITLESGEFQV YKQSHSALTA FQTEQIQDSE 

501 HSGKMVAKRQ FRIGDIAGEH TSFDKLPEGG RATYRGTAFG SDDAGGKLTY 

551 TIDFAAKQGN GKIEHLKSPE LNVDLAAADI KPDGKRHAVI SGSVLYNQAE 

601 KGSYSLGIFG GKAQEVAGSA EVKTVNGIRH IGLAAKQLEH HHHHH* 



961-983 

ATGGCCACAAACGACGACGATGTTAAAAAAGCTGCCACTGTGG 

AACGGTTTGAAAGCTGGAGAGACCATCTACGACATTGATGAAGACC^ 

GATGTTGAAGCCGACGACTTTAAAGGTCTGGGTCTGAAAAAAGTCGTGACTAACCTGACC 

AAACAAAACGTCGATGCCAAAGTAAAAGCTGGAGAATCTGAAATAGAAA^ 

GCCGCTTTAGCAGATACTGATGCCGCTCTGGATGCAACCACCAACGCCTTGAATAAAT 

TTTGCTGAAGAGACTAAGACAAATATCGTAAAAATTGATGAAAAATTAGAAGCC 

GCCGAAGCATTCAACGATATCGCCGATTCATTGGATGAAACCAACACTAAGGC^ 

GAAGCCAAACAGACGGCCGAAGAAACCAAACAAAACGTCGATGCC^^ 

GCCGAAGCTGCCGCTGGCACAGCTAATACTGCAGCCGACAAGGCCGAAGCTGTCGCTGCAAAAGTTACCGACAT^ 

GCTGATATCGCTACGAACAAAGATAATATTGCTAAAAAAGCAAACAGTGCCGACG 

AGCAAATTTGTC^GAATTGATGGTCTGAACGCTACTACCGAAAAATTGGACACACGCTTGGCT 

ATTGCCGATCACGATACTCGCCTGAACGGTTTGGATAAAACAGTGTCAGACCTGCGCAAAGA 

GCAGAACAAGCCGCGCTCTCCGGTCTGTTCCAACCTTACAACGTGGGTCGGTTC 

TACAAATCCGAATCGGCAGTCGCCATCGGTACCGGCTTCC 

GTCGGCACTTCGTCCGGTTCTTCCGCAGCCTACCATG^ 

TCTGCGCCCGACTTCAATGCAGGCGGTACCGGTATCGGCAGC 

TCTTACGCCGGTATCAAGAACGAAATGTGCAAAGACAGAAGCATGCTCTGTGCCGGTCGGGATGACGTO 

GACAGGGATGCCAAAATCAATGCCCCCCCCCCGAATCTGCATACCGGAGACTTTCCAAACCCAAATGACGCATACAAG 

AATTTGATCAACCTCAAACCTGCAATTGAAGCAGG^ 

GAATCCGTCGGCAGCATATCCTTTCCCGAACTGTATGGCAGAAAAGAACACGGCTATAACGAAAATTACAAAAACTAT 

ACGGCGTATATGCGGAAGGAAGCGCCTGAAGACGGAGGCGGTAAAGACATTGAAGCTTCT 

ATAGAGACTGAAGCAAAGCCGACGGATATCCGCCACGTAAAAGAAATCGGACACATCGATTTGGT 

GGCGGGCGTTCCGTGGACGGCAGACCTGCAGGCGGTATTGCGCCCGATGCGACGCTACACATAA 

GAAACCAAGAACGAAATGATGGTTGCAGCCATCCGCAATGCATGGGTCAA 

AATAACAGTTTTGXSAACAACATCGAGGGCAGGCACTGCCGACCTTT^ 

CAAGCGTTGCTCGACTATTCCGGCGGTGATAAAACAGACGAGGGTATCCGCCTGATGCAACAGA 

CTGTCCTACCACATCCGTAATAAAAACATGCTTTTCATC 

TATGCCCTATTGCCATTTTATGAAAAAGACGCTCAAAAAGGCATTATCACAG 

AAGTTC AAACGGGAAATGTATGGAGAAC CGGGTACAGAAC CGCTTGAGTATGGC TCCAACCATTGCGGAATTACTGC C 

ATGTGGTGCCTGTCGGCACCCTATGAAGCAAGCGTCCGTTTCACCCGTACAAACCCG 

TTTTCCGCACCCATCGTAACCGGCACGGCGGCTCTGCTGCTGCAGAAATACC 

ACCACGTTGCTGACGACGGCTCAGGACATCGGTGCAGTCGGCGTGGACAGCAAGT^ 

GGTAAGGCCATGAACGGACCCGCGTCCTTTCCGTTCGGCGACCT 

TACTCCTTCCGTAACGACATTTCAGGCACGGGCGGCCTGATCAAAAAAGGCGGC^ 

AACACCTATACGGGCAAAACCATTATCGAAGGCGGTTCGCTGG 

GAAACCAAAGGTGCGCTCATTTATAACGGGGCGGCATCCGGCGGCAGCCTGAACAGCGACGG^ 
GATACCGACCAATCCGGCGCAAACGAAACCGTACACATCAAAGGCAG 
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ACACGTTTGGGCAAACTGCTGAAAGTGGACGGTACGGCGATTA 
GGGGCAGGCTATCTCAACAGTACCGGACGACGTGTTCCCTTC 
TTCACAAACATCGAAACCGACGGCGGCCTGCTGGCTTCCCTCGACAGC 
ACGCTGTCCTATTATGTCCGTCGCGGCAATGCGGCA^ 
5 AAACACGCCGTAGAACAGGGCGGCAGCAATCTGGAAAACCTGATGGTCGAAOT 
CCCGAGACGGTTCAAACTGCGGCAGCCGACCGCACAGATATGCCG 

GCGGCAGCCGTACAGCATGCGAATGCCGC CGACGGTGTACGCATCTTC AACAGTCTCGCCGCT AC CGTCTATGCCGAC 
AGTAC CGCCGC C CATGC CGATATGCAGGGACGCCGCCTGAAAGCCGT ATCGGACGGGTT^ 
CTGCGCGTCATCGCGCAAACCCAACAGGACGGTGGAACGTG 
10 ACCCAAACCGTCGGCATTGCCGCGAAAACCGGCGAAAATATO 

TGGAGCGAAAACAGTGCAAATGCAAAAACCGACAGCATTAGTCrc 
GGCTATCTCAAAGGCCTGTTCTCCTACGGACGCTAO^ 

GAAGGCAGCGTCAACGGCACGCTGATGCAGCTGGGCGGACTGGGCGGTGTC^ 
TTGACGGTCGAAGGCGGTCTGCGCTACGACCTGCTCAAAGA^ 
1 5 AGCGGCAACAGCCTCACTGAAGGCACGCTGGTCGGACTCGCGGGTC 
GTCCTGTTTGCAACGGCGGGCGTGGAACGCGACCTGAACGGACGC 
ACTGCAGCAACCGGCAAGACGGGGGCACGCAATATGCCGCACA 
TTCGGCAACGGCTGGAACGGCTTGGCACGTTACAGCTACGC^ 
GGCGTAGGCTACCGGTTCCTCGAGCACCACCACCACCACCACTGA 

20 

1 MATNDDDVKK AATVAIAAAY NNGQEINGFK AGETIYDIDE DGTITKKDAT 
51 AADVEADDFK GLGLKKWTN LTKTVNBNKQ NVDAKVKAAE SEIEKLTTKL 
101 ADTDAALADT DAALDATTNA LNKLGENITT FAEETKTNIV KIDEKLEAVA 
151 DTVDKHAEAF NDIADSLDET NTKADEAVKT ANEAKQTAEE TKQNVDAKVK 
25 201 AAETAAGKAE AAAGTANTAA DKAEAVAAKV TDIKADIATN KDNIAKKANS 

251 ADVYTREESD SKFVRIDGLN ATTEKLDTRL ASAEKSIADH DTRLNGLDKT 
301 VSDLRKETRQ GLAEQAALSG LFQPYNVGRF NVTAAVGGYX SESAVAIGTG 
351 FRFTENFAAK AGVAVGTSSG SSAAYHVGVN YEWGSGGGGT SAPDFNAGGT 
401 GIGSNSRATT AKSAAVSYAG IKNEMCKDRS MLCAGRDDVA VTDRDAKINA 
30 451 PPPNLHTGDF PNPNDAYKNL INLKPAIEAG YTGRGVEVGI VDTGESVGSI 

501 SFPELYGRKE HGYNENYKNY TAYMRKEAPE DGGGKDIEAS FDDEAVIETE 
551 AKPTDIRHVK EIGHIDLVSH IIGGRSVDGR PAGGIAPDAT LHD4NTNDET 
601 KNEMMVAAIR NAWVKLGERG VRTVNNSFGT TSRAGTADLF QIANSBEQYR 
651 QALLDYSGGD KTDEGIRLMQ QSDYGNLSYH IRNKNMLFIF STGNDAQAQP 
35 701 NTYALLPFYE KDAQKGIITV AGVDRSGEKF KREMYGEPGT EPLEYGSNHC 

751 GITAMWCLSA PYEASVRFTR TNPIQIAGTS FSAPIVTGTA ALLLQKYPWM 
801 SNDNLRTTLL TTAQDIGAVG VDSKFGWGLL DAGKAMNGPA SFPFGDFTAD 
851 TKGTSDIAYS FRNDISGTGG LIKKGGSQLQ LHGNNTYTGK TIIEGGSLVL 
901 YGNNKSDMRV ETKGALIYNG AASGGSLNSD GIVYLADTDQ SGANETVHIK 
40 951 GSLQLDGKGT LYTRLGKLLK VDGTAIIGGK LYMSARGKGA GYLNSTGRRV 

1001 PFLSAAKIGQ DYSFFTNIET DGGLLASLDS VEKTAGSEGD TLSYYVRRGN 
1051 AARTASAAAH SAPAGLKHAV EQGGSNLENL MVELDASESS ATPETVETAA 
1101 ADRTDMPGIR PYGATFRAAA AVQHANAADG VRIFNSLAAT VYADSTAAHA 
1151 DMQGRRLKAV SDGLDHNGTG LRVIAQTQQD GGTWEQGGVE GKMRGSTQTV 
45 1201 GIAAKTGENT TAAATLGMGR STWSENSANA KTDSISLFAG IRHDAGDIGY 

1251 LKGLFSYGRY KNSISRSTGA DEHAEGSVNG TLMQLGALGG VNVPFAATGD 
1301 IjTVEGGLRYD LLKQDAFAEK GSALGWSGNS LTEGTLVGLA GLKLSQPIiSD 
1351 KAVLFATAGV ERDLNGRDYT VTGGFTGATA ATGKTGARNM PHTRLVAGLG 
1401 ADVEFGNGWN GLARYSYAGS KQYGNHSGRV GVGYRFLEHH HHHH* 

50 

961C-ORF46.1 

ATGGCCACAAACGACGACGATGTTAAAAAAGCTGCCACTGTGC^ 
AACGGTTTCAAAGCTGGAGAGACCATCTACGACATO 
55 GATGTTGAAGCCGACGACTTTAAAGGTCTGGGTCTGAAAAAAGTCGTGACTAACCTGACCAAA^ 
AAACAAAACGTCGATGCCAAAGTAAAAGCTGCAGAATCT 
GCCGCTTTAGCAGATACTGATGCCGCTCTGGATGCAA 

TTTGCTGAAGAGACTAAGACAAATATCGTAAAAATTGATGAAAAATTAGAAGCCGTGGCTGATAC 

GCCGAAGCATTCAACGATATCGCCGATTCATTGGATGAAACCAACACT 
60 GAAGCCAAACAGACGGCCGAAGAAACO^CAAAACGTCGATGCCAAAGTAAAAGCTGCA 

GCCGAAGCTGCCGCTGGCACAGCTAATACTGCAGCCGA^ 

GCTGATATCGCTACGAACAAAGATAATATTGCTAAAAAAGCAAAC^ 

AGCAAATTTGTCAGAATTGATGGTCTGAACGCTACTACCGAAAAATTG^ 

ATTGCCGATCACGATACTCGCCTGAACGGTTT^ 
65 GCAGAACAAGCCGCGCTCTCCGGTCTGTTCCAACCTTACAACGTGGGTGGATCCGGAGGAGGAGGATC 

AACGATTCTTTTATCCGGCAGGTTCTCGACCGTGAGCATTTCGAACCCGAC 
GGGGAACTTGCCGAGCGCAGCGGCCATATCGGATTGGGAAAAATACAAAGCCATCAGTT 
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CAGGCGGCCATTAAAGGAAATATCGGCTACATTGTCCGCTTTTCCGA 
AACCATGCCTCACATTCCGATTCTGATGAAGCCGGTAGTCCCGTTG^ 
C^TACGAACACCATCCCGCCGACGGCTATGACGG^ 

ATATACAGCTACGAC^TAAAAGGCGTTGCCCAAAATATCCGCCTCAACCTGACCGACAACCGCAGCAC 
CTTGCCGACCGTTTC CACAATGC CGGTAGTATGCTGACGCAAGGAGTAGGCGACGGATTCAAACGCGCC ACCCGAT AC 
AGCCCCGAGCTGGACAGATCGGGCAATGCCGCCGAAGCCTTCAACGGCACTGCAGATATC 
GCGGCAGGAGAAATTGTCGGCGCAGGCGATGCCGTGCAGGGCATAAGCGAAGGC 
TTGGGTCTGCTTTCCACCGAAAACAAGATGGCGCGCATCAAC 

GCAGCAGCCATCCGCGATTGGGCAGTCCAAAACCCCAATGCCGCACAAGGCATAGAAGCCGTC^ 
1U GCAGCCATCCCCATCAAAGGGATTGGAGCTGTTCGGGGAAAATACGGCTTGGGC 

CGGTCGCAGATGGGCGCGATCGCATTGCCGAAAGGGAAATCCGCCGTCAGCGACAA 
AAATACCCGTCCCCTTACCATTCCCGAAATATCCGOT^ 

TCAACCGTGCCGCCGTCAAACGGCAAAAATGTCAAACTGGCAGACCAACGCCACCCG 
GGTAAAGGGTOTCCGAATTTTGAGAAGCACGTGAAATATGA^ 

1 MATNDDDVKK AATVAIAAAY NNGQEINGFK AGETIYDIDE DGTITKKDAT 

51 AADVEADDFK GLGLKKWTN LTKTVNENKQ NVDAKVKAAE SEIEKLTTKL 

101 ADTDAALADT DAALDATTNA LNKLGENITT FAEETKTNIV KIDEKLEAVA 

151 DTVDKHAEAF NDIADSLDET NTKADEAVKT ANEAKQTAEE TKQNVDAKVK 

20 201 AAETAAGKAE AAAGTANTAA DKAEAVAAKV TDIKADIATN KDNIAKKANS 

251 ADVYTREESD SKFVRIDGLN ATTEKLDTRL ASAEKSIADH DTRLNGLDKT 

301 VSDLRKETRQ GLAEQAALSG LFQPYNVGGS GGGGSDLAND SFIRQVLDRQ 

351 HFEPDGKYHL FGSRGELAER SGHIGLGKIQ SHQLGNLMIQ QAAIKGNIGY 

401 IVRFSDHGHE VHSPFDNHAS HSDSDEAGSP VDGFSLYRIH WDGYEHHPAD 

25 451 GYDGPQGGGY PAPKGARDIY SYDIKGVAQN IRLNLTDNRS TGQRLADRFH 

501 NAGSMLTQGV GDGFKRATRY SPELDRSGNA AEAFNGTADI VKNIIGAAGE 

551 IVGAGDAVQG ISEGSNIAVM HGLGLLSTEN KMARINDLAD MAQLKDYAAA 

601 AIRDWAVQNP NAAQGIEAVS NIFMAAIPIK GIGAVRGKYG LGGITAHPIK 

651 RSQMGAIALP KGKSAVSDNF ADAAYAKYPS PYHSRNIRSN LEQRYGKENI 

30 701 TSSTVPPSNG KNVKLADQRH PKTGVPFDGK GFPNFEKHVK YDTLEHHHHH 

751 H* 

961c-741 

35 ATGGCCACAAACGACGACGATGTTAAAAAAGCTGCCACTGTGGCCA 

AACGGTTTCAAAGCTGGAGAGACCATCTACGACATTGATGAAGACGGCACAATTACCAAAAAA^ 
GATGTTGAAGCCGACGACTTTAAAGGTCTGGGTCTGAAAAAAGTCGTGACTAACCTGACC^ 
AAACAAAACGTCGATGCCAAAGTAAAAGCTGCAGAATCTGAAATAGAAAAGTTAAC^ 
GCCGCTTTAGCAGATACTGATGCCGCTCTGGATGCAACCACCAACGCCTT^ 

40 tttgctgaagagactaagacaaatatcgtaaaaattgatgaaaaat^^ 

gccgaagc^ttcaacgatatcgccgattcattggatgaaaccaacactaaggcagacgaagccg 

gaagccaaacagacggccgaagaaaccaaacaaaacgtcgatgcca^gtaaaa 

gccgaagctgccgctggcacagctaatactgcagc^ 

GCTGATATCGCTACGAACAAAGATAATATTGCTAAAAAAGCAAACAGTGC^ 
45 AGCAAATTTGTCAGAATTGATGGTCTGAACGCTACTACCGAAAAATTGG^ 

ATTGCCGATCACGATACTCGCCTGAACGGTTTGGATAAAACAGTGTCAGACCTGCGCAAAGAAA^ 

GCAGAACAAGCCGCGCTCTCCGGTCTGTTCCAACCTTAC^ 

ATCGGTGCGGGGCTTGCCGATGCACTAACCGCACCGCT 

CAGTCCGTCAGGAAAAACGAGAAACTGAAGCTGGCGGCACAAGGTGCGGAAAAAACTO 
50 AATACGGGCAAATTGAAGAACGACAAGGTCAGCCGTTTCGACTTTATCCGCCAAATCGAAGTGGACGGG 
ACCTTGGAGAGTGGAGAGTTCCAAGTATACAAACAAAGCCATTCCGCCTTAACCGCCTTTCAGACCGA 
GATTCGGAGCATTCCGGGAAGATGGTTGCGAAACGCCAGTTCAGA 

GACAAGCTTCCCGAAGGCGGCAGGGCGACATATCGCGGGACGGCGTTCGGTTCAGACGATGCCG 
TACACCATAGATTTCGCCGCCAAGCAGGGAAACGGCAAAATCGAACATTTGAAATCGCCAG 
55 GCCGCCGCCGATATCAAGCCGGATGGAAAACGCCATGCCGTCATCAGCGGTTCCGTCC 
GGCAGTTACTCCCTCGGTATCTTTGGCGGAAAAGCCCAGGAAG 
ATACGCCATATCGGCCTTGCCGCCAAGCAACTCGAGCACCACCACCACCACCACTGA 

1 MATNDDDVKK AATVAIAAAY NNGQEINGFK AGETIYDIDE DGTITKKDAT 

60 51 AADVEADDFK GLGLKKWTN LTKTVNENKQ NVDAKVKAAE SEIEKLTTKL 

101 ADTDAALADT DAALDATTNA LNKLGENITT FAEETKTNIV KIDEKLEAVA 

151 DTVDKHAEAF NDIADSLDET NTKADEAVKT ANEAKQTAEE TKQNVDAKVK 

201 AAETAAGKAE AAAGTANTAA DKAEAVAAKV TDIKADIATN KDNIAKKANS 

251 ADVYTREESD SKFVRIDGLN ATTEKLDTRL ASAEKSIADH DTRLNGLDKT 

65 301 VSDLRKETRQ GLAEQAALSG LFQPYNVGGS GGGGVAADIG AGLADALTAP 

351 LDHKDKGLQS LTLDQSVRKN EKLKLAAQGA EKTYGNGDSL NTGKLKNDKV 

401 SRFDFIRQIE VDGQLITLES GEFQVYKQSH SALTAFQTEQ IQDSEHSGKM 
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451 
501 
551 



VAKRQFRIGD IAGEHTSFDK LPEGGRATYR GTAFGSDDAG GKLTYTIDFA 
AKQGNGKIEH LKSPELNVDL AAADIKPDGK RHAVISGSVL YNQAEKGSYS 
LGIFGGKAQE VAGSAEVKTV NGIRHIGLAA KQLEHHHHHH * 
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ATGGCCACAAACGACGACGATGTTAAAAAAGCTGCCACTGTGGCCATTGCT 

AACGGTTTCAAAGCTGGAGAGACCATCTACGAGATTGATG 

GATGTOGAAGCCGACGACTTTAAAGGTCTGG^ 

AAACAAAACGTCGATGCCAAAGTAAAAGCTGCAGAATCTGAAATAGAAAAGTTAAC^CCAAGTT 
GCCGCTTTAGCAGATACTGATGCCGCTCTGGATGCAACCACCAACGCCTTGAA 
TOTGCTGAAGAGACTAAGACAAATATCGTAAAAATTGATGAAAAA 
GCCGAAGCATTCAACGATATCGCCGATTCATTGGATC^ 

GAAGCCAAACAGACGGCCGAAGAAACCAAACAAAACGTCGATGCCAAAGTAAAAGCTGC^ 

GCCGAAGCTGCCGCTGGCACAGCTAATACTGCAGCCGACAAGGCCGAAGCTGTCGCTGCAAAAGOT 

GCTGATATCGCTACGAACAAAGATAATATTGCTAAAAAAGCAAACAGTGCCGACGTGTACACCAGAGAA^ 

AGCAAATTTGTCAGAATTGATGGTCTGAACGCTACTACCGAAAAATTGGACA 

ATTGCCGATCACGATACTCGCCTGAACGGTTTGGATAAAACAGTGTCAGACCTGCG 

GCAGAACAAGCCGCGCTCTCCGGTCTGTTCCAACCTTACAACGTGGGTGGATCCGG^ 

GACTTCAATGCAGGCGGTACCGGTATCGGCAGCAACAGCAGAGCAAG^ 

GGTATCAAGAACGAAATGTGCAAAGACAGAAGCATGCTCTG 

GCCAAAATCAATGCCCCCCCCCCGAATCTGCATACCGGAGACTTTCCAAACCCAAATGACGCATAC^^ 

AACCTCAAACCTGCAATTGAAGCAGGCTATACAGGACGCGGGGTAGAGGTAGGTATCGT 

GGCAGCATATCCTTTCCCGAACTGTATGGCAGAAAAGAACACGOT 

ATCCGGAAGGAAGCGCCTGAAGACGGAGGCGGTAAAGACATTGAAGCTTCTTTC^ 

GAAGCAAAGCCGACGGATATCCGCCACGTAAAAGAAATCGGACACA^ 

TCCGTGGACGGCAGACCTGCAGGCGGTATTGC 

AACGAAATGATGGTTGCAGCCATCCGCAATGCATGGGTCAAGCTGGGCGA 
TTTGGAACAACATCGAGGGCAGGCACTGCCGACCTTTTCC^ 

CTCGACTATTCCGGCGGTGATAAAACAGACGAGGGTATCCGCCTGATGCAACAGAGCGATTA 

CACATCCGTAATAAAAACATGCTTTTCATCTTTTCGACAGGCAATGACGCA 

TTGCCATTTTATGAAAAAGACGCTCAAAAAGGCATT^^ 

CGGGAAATGTATGGAGAACCGGGTACAGAACCGCTTGAGTATGGCTC 

CTGTCGGCACCCTATGAAGCAAGCGTCCGTTTCAC^ 

CCCATCGTAACCGGCACGGCGGCTCTGCTGCTGCAGAAATACCCGTGGATGAGCAACGACAACCTC 
CTGACGACGGCTCAGGACATCGGTGCAGTCGGC^^ 

ATGAACGGACCCGCGTCCTTTCCGTTCGGCGACTTTACCGCCGATACGAAAGGTACATCCGATATTGC 

CGTAACGACATTTCAGGCACGGGCGGCCTGATCAAAAAAGGCGGCAGCCAACTGC^ 

ACGGGCAAAACCATTATCGAAGGCGGTTCGCTGGTGTTGTACGGCAACAACAAATCG^ 

GGTGCGCTGATTTATAACGGGGCGGCATCCGGCGGCAGCCTGAACAGCGACGGCATTGTCTATC 

CAATCCGGCGCAAACGAAACCGTACACATGAAAGGCAGTC^ 

GGCAAACTGCTGAAAGTGGACGGTACGGCGATTATCGGCGGCAAGCTGTACATGTCGGCACGCGGCAA 

TATCTCAACAGTACCGGACGACGTGTTCCCTTCCTGAGTGCCG^ 

ATCGAAACCGACGGCGGCCTGCTGGCTTCCCTCGACAGC^^ 

TATTATGTCCGTCGCGGCAATGCGGCACGGACTGCTTCGGCAGCGGCACATTCCGCGCCCGCCGGTC 
GTAGAACAGGGCGGCAGCAATCTGGAAAACCTGATGGTCGSAACTGGATGCCTC 

GTTGAAACTGCGGCAGCCGACCGCACAGATATGCCGGGCATCCGCCCCTACGGCGCAACTTTCCGCGCAGCGGCAGCC 
GTACAGCATGCGAATGCCGCCGACGGTGTACGCATCTTCAACAGTCTCGCCGCTACCGTCTATGCCGACAGTACCGCC 
GCCCATGCCGATATGCAGGGACGCCGCCTGAAAGCC^ 

ATCGCGCAAACC CAACAGGACGGTGGAACGTGGGAACAGGGCGG^ C 

GTCGGCATTGCCGCGAAAACCGGCGAAAATACGACAGCAGCCGCCACACTGGGCATGGGACGCA 

AACAGTGCAAATGCAAAAACCGACAGCATTAGTCTGTTTC^^ 

AAAGGCCTGTTCTCCTACGGACGCTACAAAAACAGCATCAGCCGCAGCACCGGTGCGGACGAACATGC 

GTCAACGGCACGCTGATGCAGCTGGGCGCACTGGGCGGTGTC 

GAAGGCGGTCTGCGCTACGACCTGCTCAAACAGGATGCATTC 

AGCCTCACTGAAGGCACGCTGGTCGGACTCGCGGGTCTGAAGCT 

GCAACGGCGGGCGTGGAACGCGACCTGAACGGACGCGACTACACGGTA^ 

ACCGGCAAGACGGGGGCACGCAATATGCCGCACACCCGTCTGGOT 

GGCTGGAACGGCTTGGCACGTTACAGCTACGCCGGTTCCAAACAGTACGGCAACCACAGCGGACGAGT 
TACCGGTTCCTCGAGCACCACCACCACCACCACTGA 

1 MATNDDDVKK AATVAIAAAY NNGQEINGFK AGETIYDIDE DGTITKKDAT 

51 AADVEADDFK GLGLKKWTN LTKTVNENKQ NVDAKVKAAE SEIEKLTTKL 

101 ADTDAALADT DAALDATTNA LNKLGENITT FAEETKTNIV KIDEKLEAVA 

151 DTVDKHAEAF NDIADSLDET NTKADEAVKT ANEAKQTAEB TKQNVDAKVK 

201 AAETAAGKAE AAAGTANTAA DKAEAVAAKV TDIKADIATN KDNIAKKANS 

251 ADVYTREESD SKFVRIDGLN ATTEKLDTRL ASAEKSIADH DTRLNGLDKT 
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301 VSDLRKETRQ GLAEQAALSG LFQPYNVGGS GGGGTSAPDF NAGGTGIGSN 

351 SRATTAKSAA VSYAGIKNKM CKDRSMLCAG RDDVAVTDRD AKINAPPPNL 

401 HTGDFPNPND AYKNLINLKP AIEAGYTGRG VEVGIVOTGE SVGSISFPEL 

451 YGRKEHGYNE NYKNYTAYMR KEAPEDGGGK DIEASFDDEA VIETEAKPTD 

501 IRHVKEIGHI DLVSHIIGGR SVDGRPAGGI APDATLHIMN TNDETKNEMM 

551 VAAIRNAWVK LGERGVRIVN NSFGTTSRAG TADLFQIANS EEQYRQALLD 

601 YSGGDKTDEG IRLMQQSDYG NLSYHIRNKN MLFIFSTGND AQAQPNTYAL 

651 LPFYEKDAQK GIITVAGVDR SGEKFKREMY GEPGTEPLEY GSNHCGITAM 

701 WCLSAPYEAS VRFTRTNPIQ IAGTSFSAPI VTGTAALLLQ KYPWMSNDNL 

751 RTTLLTTAQD IGAVGVDSKF GWGLLDAGKA MNGPASFPFG DFTADTKGTS 

801 DIAYSFRNDI SGTGGLIKKG GSQLQLHGNN TYTGKTIIEG GSLVLYGNNK 

851 SDMRVETKGA LIYNGAASGG SLNSDGIVYL ADTDQSGANE TVHIKGSLQL 

901 DGKGTLYTRL GKLLKVDGTA IIGGKLYMSA RGKGAGYLNS TGRRVPFLSA 

951 AKIGQDYSFF TNIETDGGLL ASLDSVEKTA GSEGDTLSYY VRRGNAARTA 

1001 SAAAHSAPAG LKHAVEQGGS NLENLMVELD ASESSATPET VETAAADRTD 

1051 MPGIRPYGAT FRAAAAVQHA NAADGVRIFN SLAATVYADS TAAHADMQGR f 

1101 RLKAVSDGLD HNGTGLRVIA QTQQDGGTWE QGGVEGKMRG STQTVGIAAK i 

1151 TGENTTAAAT LGMGRSTWSE NSANAKTDSI SLFAGIRHDA GDIGYLKGLF 

1201 SYGRYKNSIS RSTGADEHAE GSVNGTLMQL GALGGVNVPF AATGDLTVEG 

1251 GLRYDLIaKQD AFAEKGSALG WSGNSLTEGT LVGLAGLKLS QPLSDKAVLF 

1301 ATAGVERDLN GRDYTVTGGF TGATAATGKT GARNMPHTRL VAGLGADVEF 

1351 GNGWNGLARY SYAGSKQYGN HSGRVGVGYR FLEHHHHHH* 



961CL-ORF46.1 

ATGAAACACTTTCCATCCAAAGTACTGACCACAGCCATCCTTGCCACTTTC 
GACGACGATGTTAAAAAAGCTGCCACTGTGGCCATTGCTGCTGCC^ 

GCTGGAGAGACCATC TACGAC ATTGATGAAGACGGC ACAATTACGAAAAAAGACGCAAC^ C 
GACGACTTTAAAGGTCTGGGTCTGAAAAAAGTCGTGACTAAC^ 
GATGCCAAAGTAAAAGCTGCAGAATCTGAAATAGAAAAGTTAACAACCAAGTTA 
GATACTGATGCCGCTCTGGATGCAACCACCAACGCCTT 

ACTAAGACAAATATC GTAAAAATTGATGAAAAATTAGAAGCCGTGGCTGATAC CGTCGACAAGCATGCCGAAGCATTC 
AACGATATCGCCGATTCATTGGATGAAACCAACACTAAGGCAGACGAAGC CGTCAAAACCGCCAATGAAGC C AAACAG 
ACGGCCGAAGAAACC AAACAAAACGTCGATGCCAAAGTAAAAGCTGCAGAAACTGC AGCAGGC^ C 
GCTGGCACAGCTAATACTGCAGCCGACAAGGCCGAAGC 

ACGAACAAAGATAATATTGCTAAAAAAGCAAACAGTGCCGACGTGTACACCAGAGAAGAGTCTGACA 

AGAATTGATGGTCTGAACGCTACTACCGAAAAATTGGACACACGCTTGGCTTCTGCTC 

GATACTCGCCTGAACGGTTTGGATAAAACAGTGTCAGACCTGCGCAAAGAAACCCGCC^ 

GCGCTCTCCGGTCTGTTCCAACCTTACAACGTGGGTGGATCC 

ATCCGGCAGGTTCTCGACCGTGAGCATTTCGAACCCGACGGGAAATAC^ 

GAGCGCAGCGGCCATATCGGATTGGGAAAAATACAAAGC CATCAGTTGGGC AACCTGATGATTCAACAGGCGGC C ATT 

AAAGGAAATATCGGCTACATTGTCCGCTTTTCCGATCACGGGCACGAAGTCCATTCCCCC 

CATTCCGATTCTGATGAAGCCGGTAGTCCCGTTGACGGAT^ 

CATCCCGCCGACGGCTATGACGGGCCACAGGGCGGCGGCTATCCCGCTCCCAAAGGCGCGAG 

GACATAAAAGGCGTTGCCCAAAATATCCGCCTCAACCTGACCGACAAC^ 

TTCCACAATGCCGGTAGTATGCTGACGCAAGGAGTAGGCGACGGACT 

GACAGATCGGGCAATGCCGCCGAAGCCTTCAACGGCACTGCAGATATCGTTAAAAACATCATCGGCGCGGCAGGAGAA 
ATTGTCGGCGCAGGCGATGCCGTGCAGGGCATAAGCGAAGGCTCAAACAT^ 

TCCACCGAAAACAAGATGGCGCGCATCAACGATTTGGCAGATATGGCGCAACTCAAAGACTATGCCGCAGCAGCCA 

CGCGATTGGGCAGTC CAAAACCCCAATGCCGCACAAGGCAT AGAAGCCGTC AGCAATATCTTTATGGCAGC C ATC CC C 

ATCAAAGGGATTGGAGCTGTTCGGGGAAAATACGGCTTGGGCGGCATC 

GGCGCGATCGCATTGCCGAAAGGGAAATCCGCCGTCAGCGACAATT^ 

CCTTACCATTCCCGAAATATCCGTTCAAACTTGGAGCAGCGTTACGGCAAAGA 

CCGTCAAACGGCAAAAATGTCAAACTGGCAGACCAACGCCACCCGAAGACAGGCGTACCGTTTGA 

CCGAATTTTGAGAAGCACGTGAAATATGATACGTAACTCGAG 

1 MKHFPSKVLT TAILATFCSG ALAATNDDDV KKAATVAIAA AYNNGQEING 

51 FKAGETIYDI DEDGTITKKD ATAADVEADD FKGLGLKKVV TNLTKTVNEN 

101 KQNVDAKVKA AESEIEKLTT KLADTDAALA DTDAALDATT NALNKLGENI 

151 TTFAEETKTN IVKIDEKLEA VADTVDKHAE AFNDIADSLD ETNTKADEAV 

201 KTANEAKQTA EETKQNVDAK VKAAETAAGK AEAAAGTANT AADKAEAVAA 

251 KVTDIKADIA TNKDNIAKKA NSADVYTREE SDSKFVRIDG LNATTEKLDT 

301 RLASAEKSIA DHDTRLNGLD KTVSDLRKET RQGLAEQAAL SGLFQPYNVG 

351 GSGGGGSDLA NDSFIRQVLD RQHFEPDGKY HLFGSRGELA ERSGHIGLGK 

401 IQSHQLGNLM IQQAAIKGNI GYXVRFSDHG HEVHSPFDNH ASHSDSDEAG 

451 SPVDGFSLYR IHWDGYEHHP ADGYDGPQGG GYPAPKGARD IYSYDIKGVA 

501 QNIRLNLTDN RSTGQRLADR FHNAGSMLTQ GVGDGFKRAT RYSPELDRSG 
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551 NAAEAFNGTA DIVKNIIGAA GEIVGAGDAV QGISEGSNIA VMHGLGLLST 

601 ENKMARINDL ADMAQLKDYA AAAIRDWAVQ NPNAAQGIEA VSNIFMAAIP 

651 IKGIGAVRGK YGLGGITAHP IKRSQMGAIA LPKGKSAVSD NFADAAYAKY 

701 PSPYHSRNIR SNLEQRYGKE NITSSTVPPS NGKNVKLADQ RHPKTGVPFD 

5 751 GKGFPNFEKH VKYDT* 

961CL-741 

ATGAAACACTTTCCATCCAAAGTACTGACCACAGCCATC^ 

1 0 GACGACGATGTTAAAAAAGCTGCCACTGTGGCCATTGCTGCTGC CTAC AACAATGGC C^GAAATCAACGGTTTCAAA 

GCTGGAGAGACCATCTACGACATTGATGAAGACGGCACAATTACCAAAAAAGACGCA^ 
GACGACTTTAAAGGTCTGGGTCTGAAAAAAGTCGTGACTAACCTGACCAAAACCGT^ 
GATGCCAAAGTAAAAGCTGCAGAATCTGAAATAGAAAAGTTAACAACCAAGTTAGC^ 
GATACTGATGCCGCTCTGGATGCAACCACCAACGCCTO 

1 5 ACTAAGACAAATATCGTAAAAATTGATGAAAAATTAGAAGCCGTGGCTGATACCGTCG 
AACGATATCGCCGATTCATTGGATGAAACCAACACT 

ACGGCCGAAGAAACCAAACAAAACGTCGATGCCAAAGTAAAAGCTGCAGAAACTGC^ 

GCTGGCACAGCTAATACTGCAGCCGACAAGGCCGA^ 

ACGAACAAAGATAATATTGCTAAAAAAGCAAACAGTGCCG 

20 

AGAATTGATGGTCTGAACGCTACTACCGAAAAATTGGACACACGCTTGGCTTCTGCTGAAAAATCC^ 

GATACTCGCCTGAACGGTTTGGATAAAACAGTGTCAGACCTGCGCAAAGAAAC 

GCGCTCTCCGGTCTGTTCCAACCTTACAA 

CTTGCCGATGCACTAACCGCACCGCTCGACCATAAAGAC 

AAAAACGAGAAACTGAAGCTGGCGGCACAAGGTGCGGAAAAAACTTATGGA 
25 TTGAAGAACGACAAGGTCAGCCGTTTCGACTTTATCCGCCAAATC 

GGAGAGTTCCAAGTATACAAACAAAGCCATTCCGCCTTAAC CGC CTTTCAGACCGAGCAAATACAAGATTCGGAGC AT 

TCCGGGAAGATGGTTGCGAAACGCCAGTTCAGAATCGGCGACATAGCGGGCGAACATACA 

GAAGGCGGCAGGGCGACATATCGCGGGACGGCGTTCGGTTCAGACGATGCCGGCGGAAAAC 

TTCGCCGCGAAGCAGGGAAACGGCAAAATCGAACATTTGAAATCGCCA 
30 ATCAAGCCGGATGGAAAACGCCATGCCGTCATCAGCGGTTC^ 

CTCGGTATCTTTGGCGGAAAAGCCCAGGAAGTTGCCGGCAGCGCGGAAGTGAAAACCGTA 

GGCCTTGCCGCCAAGCAACTCGAGCACCACCACCACCACCACTGA 

1 MKHFPSKVLT TAILATFCSG ALAATNDDDV KKAATVAIAA AYNNGQEING 

35 51 FKAGETIYDI DEDGTITKKD ATAADVEADD FKGLGLKKW TNLTKTVNEN 

101 KQNVDAKVKA AESEIEKLTT KLADTDAALA DTDAALDATT NALNKLGENI 

151 TTFAEETKTN IVKIDEKLEA VADTVDKHAE AFNDIADSLD ETNTKADEAV 

201 KTANEAKQTA EETKQNVDAK VKAAETAAGK AEAAAGTANT AADKAEAVAA 

251 KVTDIKADIA TNKDNIAKKA NSADVYTREE SDSKFVRIDG LNATTEKLDT 

40 301 RLASAEKSIA DHDTRLNGLD KTVSDLRKET RQGLAEQAAL SGLFQPYNVG 

351 GSGGGGVAAD IGAGLADALT APLDHKDKGL QSLTLDQSVR KNEKLKLAAQ 

401 GAEKTYGNGD SLNTGKLKKD KVSRFDFIRQ IEVDGQLITL ESGEFQVYKQ 

451 SHSALTAFQT EQIQDSEHSG KMVAKRQFRI GDIAGEHTSF DKLPEGGRAT 

501 YRGTAFGSDD AGGKLTYTID FAAKQGNGKI EHLKSPELNV DLAAADIKPD 

45 551 GKRHAVTSGS VLYNQAEKGS YSLGIFGGKA QEVAGSAEVK TVNGIRHIGL 

601 AAKQLEHHHH HH* 

961cL-983 

50 ATGAAACACTTTCCATCCAAAGTACTGACCACAGCCATC CTTGCCACTTTCTGTAGCGGCGCACTGGCAGC CACAAAC 

GACGACGATGTTAAAAAAGCTGCCACTGTGGCCATTGCTGCTGCCTACAAC 

GCTGGAGAGACCATCTACGACATTGATGAAGACGGCACAATTACCAAAAAAGA 

GACGACTTTAAAGGTCTGGGTCTGAAAAAAGTCGTGACTAACC 

GATGCCAAAGTAAAAGCTGCAGAATCTGAAATAGAAAAGTT 
55 GATACTGATGCCGCTCTGGATGCAACCACCAACGCCTTGAATAAATTGGGAGAAA^ 

ACTAAGACAAATATCGTAAAAATTGATGAAAAATTAGAAGCCGTGGCTGATACCGTCGACAAGCATGC 

AACGATATCGCCGATTCATTGGATGAAACCAACACTAAGGCAGACGAAGCCGTCAAAACCGCCAATC 

ACGGCCGAAGAAACCAAACAAAACGTCGATGCCAAAGTAAAAGC^ 

GCTGGCACAGCTAATACTGCAGCCGACAAGGCCGAAGCTGTCGCTGCAAAAGTTA 
60 ACGAACAAAGATAATATTGCTAAAAAAGCAAACAGTGCCGACGTG 

AGAATTGATGGTCTGAACGCTACTACCGAAAAATTCGACACAC 

GATACTCGCCTGAACGGTTTGGATAAAACAGTGTCAGACCTGC^ 

GCGCTCTCCGGTCTGTTCCAACCTTACAACGTGGGTGGATC 

GGCGGTACCGGTATCGGCAGCAACAGCAGAGCAACAACAGCGAA^^ 
65 GAAATGTGCAAAGACAGAAGCATGCTCTGTGCCGGTC 

GCCCCCCCCCCGAATCTGCATACCGGAGACTTTCCAA 

GCAATTGAAGCAGGCTATACAGGACGCGGGGTAGAGGTAGGTATCGTCGAGACAGGCGAAT 
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TTTCCCGAACTGTATGGCAGAAAAGAACACGGCTATAACGAAAAOT 
GCGCCTGAAGACGGAGGCGGTAAAGAC^TTGAAGCTTCT 

ACGGATATCCGCCACGTAAAAGAAATCGGACACATCGATTTGGTCTCCCATACT 

AGACCTGCAGGCGGTATTGCGCCCGATGCGACGCTACACATAATGAAT^ 

GTTGCAGCCATCCGCAATGCATGGGTGAAGCTGGGCGAACGTC 

TCGAGGGCAGGCACTGCCGACCTTTTCCAAATAGCCAATO 

GGCGGTGATAAAACAGACGAGGGTATCCGCCTGATGCAACAGAGCG 

AAAAACATGCTTTTCATCTTTTCGACAGGCAATGACGCACA^ 

GAAAAAG&CGCTCAAAAAGGCATTATCACAGTCGCAC^ 

GGAGAACCGGGTACAGAACCGCTTGAGTATGGCTCCAACCATTGCGGAATTACTGCCA 
TATGAAGCAAGCGTCCGTTTCACCCGTACAAACCCGATTCAAATTGCCGGAAC^ 
GGCACGGCGGCTCTGCTGCTGCAGAAATACCCGTGGATGAGCAACGAC^ 
CAGGACATCGGTGCAGTCGGCGTGGACAGC^GTTCGGCTGGGGAC^ 

GCGTCCTTTCCGTTCGGCGACTTTACCGCCGATACGAAAGGTACATCCGATATTGCCTACTCCT^ 

TC^GGCACGGGCGGCCTGATCAAAAAAGGCGGCAGCCAACTGCAA 

ATTATCGAAGGCGGTTCGCTGGTGTTGTACGGCAACAA 

TATAACGGGGCGGCATCCGGCGGCAGCCTGAACAGCGACGGCATTGTCT 

AACGAAACCGTACACATCAAAGGCAGTCTGCAGCTGGACGGCAAAGGTACGCTGTAC^ 

AAAGTGGACGGTACGGCGATTATCGGCGGCAAGCTGTACATGTCG^^ 

ACCGGACGACGTCTTCCCTTCCTGAGTGCCGCCAAAATC 

GGCGGCCTGCTGGCTTCCCTCGACAGCGTCGAAAAAACAGCGGGCAGTGAAGGCGAGA.CGC 
CGCGGCAATGCGGCACGGACTGCTTCGGCAGCGGCACATTCCGCGCCCGCCGGTC 

GGCAGCAATCTGGAAAAC CTGATGGTCGAACTGGATGCCTCCGAATCATCCGCAACAC CCGAGACGGTTGAAACTGCG 

GCAGCCGACCGCACAGATATGCCGGGCATCCGCCCCTACGGCGCAACTTTCCGCGC^ 

AATGCCGCCGACGGTGTACGCATCTTCAACAGTCTCGCCGC 

ATGCAGGGACGCCGCCTGAAAGCCGTATCGGACGGGTTGGACC^^ 

CAACAGGACGGTGGAACGTGGGAACAGGGCGGTGTTGAAGGCAAAATGCGCGGCAG 

GCGAAAACCGGCGAAAATACGACAGCAGCCGCCACACTGGGCATGGGACGCAGCACATGGAGCG 

GCAAAAACCGAGAGCATTAGTCTGTTTGCAGGCATAC^ 

TCCTACX5GACGCTACAAAAACAGCATCAGCCGCAGCACCGGTGCGGACGAACATGCGGAA 
CTGATGCAGCTGGGCGCACTGGGCGGTGTCAACGTTCCGTTTGCCGCAACGGGAG^ 
CGCTACGACCTGCTCAAAGAGGATGCATTCGCCGAAAAAGGCAGTG^ 
GGCACGCTGGTCGGACTCGCGGGTCTGAAGCTGTCGCAACCCTTGA 

GTGGAACGCGACCTGAACGGACGCGACTACACGGTAACGGGCGGCTTTACCGGCGCGACTGCAGCAACCG 
GGGGCACGGAATATGCCGCACACCCGTCTGGTTGCCGGCCTGGGCGCGGATGTC 

TTGGCACGTTACAGCTACGCCGGTTCCAAACAGTACGGCAACCACAGCGGACGAGTCGGCGTAGGCTACCGG1T 
CTCGAG % 
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TAILATFCSG ALAATNDDDV 
DEDGTITKKD ATAADVEADD 
AESEIEKLTT KLADTDAALA 
IVKIDEKLEA VADTVDKHAE 
EETKQNVDAK VKAAETAAGK 
TNKDNIAKKA NSADVYTREE 
DHDTRLNGLD KTVSDLRKET 
DFNAGGTGIG SNSRATTAKS 
RDAKINAPPP NLHTGDFPNP 
GESVGSISFP ELYGRKEHGY 
EAVIETEAKP TDIRHVKEIG 
MNTNDETKNE MMVAAIRNAW 
NSEEQYRQAL LDYSGGDKTD 
NDAQAQPNTY ALLPFYEKDA 
EYGSNHCGIT AMWCLSAPYE 
LQKYPWMSND NLRTTLLTTA 
FGDFTADTKG TSDIAYSFRN 
EGGSLVLYGN NKSDMRVETK 
NETVHIKGSL QLDGKGTLYT 
NSTGRRVPFL SAAKIGQDYS 
YYVRRGNAAR TASAAAHSAP 
ETVETAAADR TDMPGIRPYG 
DSTAAHADMQ GRRLKAVSDG 
RGSTQTVGIA AKTGENTTAA 
DAGDIGYLKG LFSYGRYKNS 
PFAATGDLTV EGGLRYDLLK 
LSQPLSDKAV IjFATAGVERD 
RLVAGLGADV EFGNGWNGLA 



KKAATVAIAA AYNNGQEING 
FKGIiGLKKW tnltktvnen 

DTDAALDATT NALNKLGENI 
AFNDIADSLD ETNTKADEAV 
AEAAAGTANT AADKAEAVAA 
SDSKFVRIDG LNATTEKLDT 
RQGLAEQAAL SGLFQPYNVG 
AAVSYAGIKN EMCKDRSMLC 
NDAYKNLINL KPAIEAGYTG 
NENYKNYTAY MRKEAPEDGG 
HIDLVSHIIG GRSVDGRPAG 
VKLGERGVRI VNNSFGTTSR 
EGIRLMQQSD YGNLSYHIRN 
QKGIITVAGV DRSGEKFKRE 
ASVRFTRTNP IQIAGTSFSA 
QDIGAVGVDS KFGWGLLDAG 
DISGTGGLIK KGGSQLQLHG 
GALIYNGAAS GGSLNSDGIV 
RLGKLLKVDG TAIIGGKLYM 
FFTNIETDGG LLASLDSVEK 
AGLKHAVEQG GSNLENLMVE 
ATFRAAAAVQ HANAADGVRI 
LDHNGTGLRV IAQTQQDGGT 
ATLGMGRSTW SENSANAKTD 
ISRSTGADEH AEGSVNGTLM 
QDAFAEKGSA LGWSGNSLTE 
LNGRDYTVTG GFTGATAATG 
RYSYAGSKQY GNHSGRVGVG 
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It will be understood that the invention has been described by way of example only and 
modifications may be made whilst remaining within the scope and spirit of the invention. For 
instance, the use of proteins from other strains is envisaged [e.g. see WO00/66741 for 
polymorphic sequences for ORF4, ORF40, ORF46, 225, 235, 287, 519, 726, 919 and 953]. 

5 

EXPERIMENTAL DETAILS 

Cloning strategy and oligonucleotide design 

Genes coding for antigens of interest were amplified by PCR, using oligonucleotides 
designed on the basis of the genomic sequence of N. meningitidis B MC58. Genomic DNA 
10 from strain 2996 was always used as a template in PCR reactions, unless otherwise specified, 
and the amplified fragments were cloned in the expression vector pET21b+ (Novagen) to 
express the protein as C-terminal His-tagged product, or in pET-24b+(Novagen) to express 
the protein in Untagged* form (e.g. AG 287K). 

Where a protein was expressed without a fusion partner and with its own leader peptide (if 
15 present), amplification of the open reading frame (ATG to STOP codons) was performed. 

Where a protein was expressed in 'untagged' form, the leader peptide was omitted by 
designing the 5*-end amplification primer downstream from the predicted leader sequence. 

The melting temperature of the primers used in PCR depended on the number and type of 
hybridising nucleotides in the whole primer, and was determined using the formulae: 
20 T m i = 4 (G+C)+ 2 (A+T) (tail excluded) 

T^ = 64.9 + 0.41 (% GC) - 600/N (whole primer) 

The melting temperatures of the selected oligonucleotides were usually 65-70°C for the 
whole oligo and 50-60°C for the hybridising region alone. 

Oligonucleotides were synthesised using a Perkin Elmer 394 DNA/RNA Synthesizer, eluted 
25 from the columns in 2.0ml NH4OH, and deprotected by 5 hours incubation at 56°C. The 
oligos were precipitated by addition of 0.3M Na-Acetate and 2 volumes ethanol. The 
samples were centrifuged and the pellets resuspended in water. 







Sequences 


Restriction 
site 


fti (961 )- 


Fwd 


CGCGGATCC -GGAGGGGGTGGTGTCG 


BamHI 
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741(MC58)-His 








Rev 


CCCGCrCQAG-TTGCTTGGCGGCAAGGC 


Xhol 


£u (961 )-983-His 


Fwd 


CGCGGATCC - GGCGGAGGCGGCACTT 


-Ufll 1 1 ILL 


Rev 


CCCGCECGAQ-GAACCGGTAGCCTACG 


Xhol 


tu (Vol)- Ort46.1- 
His 


Fwd 


CGCGGATCCGGTGGTGGTGGT- 
TCAGATTTGGCAAACGATTC 


BamHI 


Rev 


CCCGO^GAG-CGTATCATATTTCACGTGC 


Xhol 


fu (961 c-L). 
741(MC58) 


Fwd 


CGCGGATCC -GGAGGGGGTGGTGTCG 


BamHT 




Rev 


CCCGCrCQAG-TTATTGCTTnfirnrtrA ah 


Xhol 


fti (961c-L )-983 


X Wli 


CGCGGATCC - GGCGGAnncnnc ACTT 


iJamriL 


Rev 


CCCGCTCQAG-TCAGAACCGGTAGCCTAC 


Xhol ' 


fu (961c-L)- 
Orf4f» 1 


Fwd 


CGCGGATCCGGTGGTGGTGGT- 
tc a ci a TTTrkTr 1 a a a rr, A TTP 


BamHI 


Rev 


CCCGCTCGAG-TTACGTATCATATTTCAmTfiC 


Xhol 


fu-(AG287)-919. 
His 


rwd 


vAjCUIj A 1 1 UU 1 CjCj ILKjT- 
CAAAGCAAGAGCATCCAAACC 


BamHI 


Rev 


CCCAAGCTT-1TCGGGCGGTATTCGGGCTTC 


HindHI 


fu-(AG287)-953- 
His 


Fwd 


CGCGQATCCGGTGGTGGTGGT- 
GCCACCTACAAAGTGGAC 


BamHI 

XJ CU1U JJk 


Rev 


GCCCAAGCTT-TTGT1TGGCTGCCTCGAT 


HindITT 


fu-(AG287)-961- 
His 


Fwd 


CGCGGATCCGGTGGTGGTGGT-ACAAGCGACGACG 


Dili ILtll 


Rev 


GCCCAAGCET-CCACTCGTAATTGACGCC 


HindHI . 


fu-(AG287> 

Orf4f» 1 -His 


Fwd 


CGCGGATCCGGTGGTGGTGGT- 


BamHI 


Rev 


CCCAAGCTT^GTATCATATTTCACGTGC 


HindHI 


fu-(AG287-919). 
kj rr**o» x -ins 


Fwd 


CCCAAGCTTGGTGGTGGTGGTGGT- 

TC A O A TTTHflP A A A Cfl A TTr 1 


HindHI 


Rev 


CCCGCTCGAG-CGTATCATATTTCACnTnr 


Xhol 


fu-(AG2«7- 
Orf46.1V919-His 


Fwd 


CCCAAGCTTGGTGGTGGTGGTGGT- 
CAAAGCAAGAGCATCCAAACC 


HindHI 


Rev 


CCCGCTCGAG-CGGGCGGTATTCGGGCTT 


Xhol ^ 


fu AG287(394.98)- 

■ 


Fwd 


CGCGGATCCQCTAGC-CCCGATGTTAAATCGGC 


Nhel 


Rev 


CGGGGATCC-ATCCTGCTCTTTTTTGCCGG 


BamHI 


fu Orfl-(Orf46.1)- 
His 


Fwd 


CGCGGATCCGCTAGC-GGACACACTTATTTCGGCATC 


Nhel 


Rev 


CGCGGATCC^CAGCGGTAGCCTAATTTGAT 




fu (Orfl)-Orf46.1- 
His 


Fwd 


CGCGGATCCGGTGGTGGTGGT- 
TCAGATTTGGCAAACGATTC 


BamHI 


Rev 


CCCAAGCTX-CGTATCATATTTCACGTGC 


Hindni 


fu(919)-Orf46.1- 
His 


Fwdl 


GCGGCGTCGACGGTGGCGGAGGCACTGGATCCTCAG 


Sail 


Fwd2 


GGAGGCACTGGATCCTCAGATTTGGCAAACGATTC 




Rev 


CCCGCTCGAG-CGTATCATATTTCACGTfir: 


Xhol 


Fu (orf46>287-His 


Fwd 


CGGGGATCCGGGGGCGGCGGTGGCG 


BamHI 


Rev 


CCCAASCniATCCTGCTCiTITiTGCCGGC 


ffindffl 


Fu (orf46)-919-His 


Fwd 


CGCGGATCCGGTGGTGGTGGTCAAAGCAAGAGCATCCA 
AACC 


BamHI 


Rev 


CCCAAGCTTCGGGCGGTATTCGGGCTTC 


Hindm 
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Fu(orf46-919)- 
287-His 


Fwd 


CCCCAAGCTTGGGGGCGGCGGTGGCG 


HindlH 


Rev 


CCCGCTCGAGATCCTGCTCrrrrriGCCGGC 


Xhol 


Fu (orf46-287)- 
919-His 


Fwd 


(XCMGCTTGGTGGTGGTGGTGGTCAAAGCAAGAGCAT 
CCAAACC 


Hindffl 


Rev 


CCCGCTCGAGCGGGCGGTATTCGGGCTT 


Xhol 


(AG741 )-961c-His 


Fwdl 
Fwd2 


GGAGGCACTGGATCCGCAGCCACAAACGACGACGA 
GCGGC£E£SAfi-GGTGGCGGAGGCACTGGATCCGCAG 


Xhol 


Rev 


CCCGCTCGAG-ACCCAGCrTGTAAGGTTG 


Xhol 


(AG741 )-961-His 


Fwdl 
Fwd2 


GGAGGCACTGGATCCGCAGCCACAAACGACGACGA 
GCGGCCTCGAG-GGTGGCGGAGGCACTGGATCCGCAG 


Xhol 


Rev 


CCCGCTCGAG-CCACTCGTAATTGACGCC 


Xhol 


(AG741 )-983-His 


Fwd 


OCGGCCTCGAG- 

GGATCCGGCGGAGGCGGCACTTCTGCG 


Xhol 


Rev 


CCCGCECQAS-GAACCGGTAGCCTACG 


Xhol 


(AG741 )-orf46.1- 
His 


Fwdl 
Fwd2 


GGAGGCACTGGATCCTCAGATTTGGCAAACGATTC 
GCGGCGTCGACGGTGGCGGAGGCACTGGATCCTCAGA 


Sail 


Rev 


CCCGCTCGAG-CGTATCATATTTCACGTGC 


Xhol 


(AG983)- 
741(MC58)-His 


Fwd 


GCGGCCICQACJ-GGATCCGGAGGGGGTGGTGTCGCC 


Xhol 


Rev 


CCCGCTCGAG-TTGCTTGGCGGCAAG 


Xhol 


(AG983)-961c-His 


Fwdl 
Fwd2 


GGAGGCACTGGATCCGCAGCCACAAACGACGACGA 
GCGGCCTCGAG-GGTGGCGGAGGCACTGGATCCGCAG 


Xhol 




Rev 


CCCGSCGAG-ACCCAGCTTGTAAGGTTG 


Xhol 


(AG983)-961-His 


Fwdl 
Fwd2 


GGAGGCACTGGATCCGCAGCCACAAACGACGACGA 
GCGGCCTCGAG-GGTGGCGGAGGCACTGGATCCGCAG 


Xhol 


Rev 


CCCGCTCGAG-CCACTCGTAATTGACGCC 


Xhol 


(AG983)- Orf46.1- 
His 


Fwdl 
Fwd2 


GGAGGCACTGGATCCTCAGATTTGGCAAACGATTC 
GCGGCGTC^ACGGTGGCGGAGGCACTGGATCCTCAGA 


Sail 


Rev 


CCCGCTCGAG-CGTATCATATTTCACGTGC 


Xhol 



* This primer was used as a Reverse primer for all the C terminal fusions of 287 to the His-tag. 

5 Forward primers used in combination with the 287-His Reverse primer. 
NB - All PCR reactions use strain 2996 unless otherwise specified (e.g. strain MC58) 



In all constructs starting with an ATG not followed by a unique Nhel site, the ATG codon is 
5 part of the Ndel site used for cloning. The constructs made using Nhel as a cloning site at the 
5' end (e.g. all those containing 287 at the N-terminus) have two additional codons (GCT 
AGC) fused to the coding sequence of the antigen. 

Preparation of chromosomal DNA templates 

N.meningitidis strains 2996, MC58, 394.98, 1000 and BZ232 (and others) were grown to 
10 exponential phase in 100ml of GC medium, harvested by centrifiigation, and resuspended in 
5ml buffer (20% w/v sucrose, 50mM Tris-HCl, 50mM EDTA, pH8). After 10 minutes 
incubation on ice, the bacteria were lysed by adding 10ml of lysis solution (50mM NaCl, 1% 
Na-Sarkosyl, 50jig/ml Proteinase K), and the suspension incubated at 37°C for 2 hours. Two 
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phenol extractions (equilibrated to pH 8) and one CHCls/isoamylalcohol (24:1) extraction 
were performed. DNA was precipitated by addition of 0.3M sodium acetate and 2 volumes 
of ethanol, and collected by centrifugation. The pellet was washed once with 70%(v/v) 
ethanol and redissolved in 4.0ml TE buffer (lOmM Tris-HCl, ImM EDTA, pH 8.0)." The 
5 DNA concentration was measured by reading OD 2 6o. 

PCR Amplification 

The standard PCR protocol was as follows: 200ng of genomic DNA from 2996, MC58 1000, 
or BZ232 strains or lOng of plasmid DNA preparation of recombinant clones were used as 
template in the presence of 40|iM of each oligonucletide primer, 400-800 |iM dNTPs 
10 solution, Ix PCR buffer (including 1.5mM MgCl 2 ), 2.5 units TaqI DNA polymerase (using 
Perkin-Elmer AmpliTaQ, Boerhingher Mannheim Expand™ Long Template). 

After a preliminary 3 minute incubation of the whole mix at 95°C, each sample underwent a 
two-step amplification: the first 5 cycles were performed using the hybridisation temperature 
that excluded the restriction enzyme tail of the primer (T m i). This was followed by 30 cycles 
15 according to the hybridisation temperature calculated for the whole length oligos (T^). 
Elongation times, performed at 68°C or 72°C, varied according to the length of the Orf to>e 
amplified. In the case of Orf 1 the elongation time, starting from 3 minutes, was increased by 
15 seconds each cycle. The cycles were completed with a 10 minute extension step at 72°C. 

The amplified DNA was either loaded directly on a 1% agarose gel. The DNA fragment 
20 corresponding to the band of correct size was purified from the gel using the QiagenXJel 
Extraction Kit, following the manufacturer's protocol. 

Digestion of PCR fragments and of the cloning vectors 

The purified DNA corresponding to the amplified fragment was digested with the 
appropriate restriction enzymes for cloning into pET-21b+, pET22b+ or pET-24b+. Digested 
25 fragments were purified using the QIAquick PCR purification kit (following the 
manufacturer's instructions) and eluted with either H 2 0 or lOmM Tris, pH 8.5. Plasmid 
vectors were digested with the appropriate restriction enzymes, loaded onto a 1.0% agarose 
gel and the band corresponding to the digested vector purified using the Qiagen QIAquick 
Gel Extraction Kit. 



t 
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Cloning 

The fragments corresponding to each gene, previously digested and purified, were ligated 
into pET21b+, pET22b+ or pET-24b+. A molar ratio of 3:1 fragment/vector was used with 
T4 DNA ligase in the ligation buffer supplied by the manufacturer. 

5 Recombinant plasmid was transformed into competent E.coli DH5 or HB101 by incubating 
the ligase reaction solution and bacteria for 40 minutes on ice, then at 37°C for 3 minutes. 
This was followed by the addition of 800^1 LB broth and incubation at 37°C for 20 minutes. 
The cells were centrifuged at maximum speed in an Eppendorf microfuge, resuspended in 
approximately 200^1 of the supernatant and plated onto LB ampicillin (lOOmg/ml ) agar. 

10 Screening for recombinant clones was performed by growing randomly selected colonies 
overnight at 37°C in 4.0ml of LB broth + lOOng/ml ampicillin. Cells were pelleted and 
plasmid DNA extracted using the Qiagen QIAprep Spin Miniprep Kit, following the 
manufacturer's instructions. Approximately ljug of each individual miniprep was digested 
with the appropriate restriction enzymes and the digest loaded onto a 1-1.5% agarose gel 

15 (depending on the expected insert size), in parallel with the molecular weight marker (lkb 
DNA Ladder, GIBCO). Positive clones were selected on the basis of the size of insert. 

Expression 

After cloning each gene into the expression vector, recombinant plasmids were transformed 
into Ecoli strains suitable for expression of the recombinant protein. l|il of each construct 

20 was used to transform Exoli BL21-DE3 as described above. Single recombinant colonies 
were inoculated into 2ml LB+Amp (lOOjig/ml), incubated at 37°C overnight, then diluted 
1:30 in 20ml of LB+Amp (100|ig/ral) in 100ml flasks, to give an OD600 between 0.1 and 0.2. 
The flasks were incubated at 30°C or at 37°C in a gyratory water bath shaker until OD^oo 
indicated exponential growth suitable for induction of expression (0.4-0.8 OD). Protein 

25 expression was induced by addition of l.OmM IPTG. After 3 hours incubation at 30°C or 
37°C the OD600 was measured and expression examined. 1.0ml of each sample was 
centrifuged in a microfuge, the pellet resuspended in PBS and analysed by SDS-PAGE and 
Coomassie Blue staining. 

Purification ofHis-tagged proteins 
30 Various forms of 287 were cloned from strains 2996 and MC58. They were constructed with 
a C-terminus His-tagged fusion and included a mature form (aa 18-427), constructs with 
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deletions (Al, A 2, A3 and A4) and clones composed of either B or C domains. For each 
clone purified as a His-fusion, a single colony was streaked and grown overnight at 37°C on 
a LB/Amp (100 ug/ml) agar plate. An isolated colony from this plate was inoculated into 
20ml of LB/Amp (100 ug/ml) liquid medium and grown overnight at 37°C with shaking 
The overnight culture was diluted 1:30 into 1.0 L LB/Amp (100 ug/ml) liquid medium and 
allowed to grow at the optimal temperature (30 or 37°C) until the OD 550 reached 0.6-0.8. 
Expression of recombinant protein was induced by addition of IPTG (final concentration 
l.OmM) and the culture incubated for a further 3 hours. Bacteria were harvested by 
centrifugation at 8000g for 15 min at 4°C. The bacterial pellet was resuspended in 7.5 ml of 
either (i) cold buffer A (300 mM NaCl, 50 mM phosphate buffer, 10 mM imidazole, pH 8.0) 
for soluble proteins or (ii) buffer B (lOmM Tris-HCl, 100 mM phosphate buffer, pH 8.8 and, 
optionally, 8M urea) for insoluble proteins. Proteins purified in a soluble form included 
287-ffis, Al, A2, A3 and A4287-His, A4287MC58-His, 287c-His and 287cMC58-His. 
Protein 287bMC58-His was insoluble and purified accordingly. Cells were disrupted by 
sonication on ice four times for 30 sec at 40W using a Branson sonifier 450 and centrifuged 
at 13000xg for 30 min at 4°C. For insoluble proteins, pellets were resuspended in 2.0 ml 
buffer C (6 M guanidine hydrochloride, 100 mM phosphate buffer, 10 mM Tris- HC1, pH 7.5 
and treated with 10 passes of a Dounce homogenizer. The homogenate was centrifuged at 
13000g for 30 min and the supernatant retained. Supernatants for both soluble and insoluble 
preparations were mixed with 150ul Ni 2+ -resin (previously equilibrated with either buffer A 
or buffer B, as appropriate) and incubated at room temperature with gentle agitation for 30 
min. The resin was Chelating Sepharose Fast Flow (Pharmacia), prepared according to the 
manufacturer's protocol. The batch-wise preparation was centrifuged at 700g for 5 min at 
4°C and the supernatant discarded. The resin was washed twice (batch-wise) with 10ml 
buffer A or B for 10 min, resuspended in 1.0 ml buffer A or B and loaded onto a disposable 
column. The resin continued to be washed with either (i) buffer A at 4°C or (ii) buffer B at 
room temperature, until the OD280 of the flow-through reached 0.02-0.01. The resin was 
further washed with either (i) cold buffer C (300mM NaCl, 50mM phosphate buffer, 20mM 
imidazole, pH 8.0) or (ii) buffer D (lOmM Tris-HCl, lOOmM phosphate buffer, pH 6.3 and, 
optionally, 8M urea) until OD280 of the flow-through reached 0.02-0.01. The His-fusion 
protein was eluted by addition of 700ul of either (i) cold elution buffer A (300 mM NaCl, 
50mM phosphate buffer, 250 mM imidazole, pH 8.0) or (ii) elution buffer B (10 mM 
Tris-HCl, 100 mM phosphate buffer, pH 4.5 and, optionally, 8M urea) and fractions 



WO 01/64920 



-41- 



PCT/IB01/00420 



collected until the OD280 indicated all the recombinant protein was obtained. 20ul aliquots of 
each elution fraction were analysed by SDS-PAGE. Protein concentrations were estimated 
using the Bradford assay. 

Renaturation of denatured His-fusion proteins. 

Denaturation was required to solubilize 287bMC8, so a renaturation step was employed prior 
to immunisation. Glycerol was added to the denatured fractions obtained above to give a 
final concentration of 10% v/v. The proteins were diluted to 200 ug/ml using dialysis buffer 
I (10% v/v glycerol, 0.5M arginine, 50 mM phosphate buffer, 5.0 mM reduced glutathione, 
0.5 mM oxidised glutathione, 2.0M urea, pH 8.8) and dialysed against the same buffer for 
12-14 hours at 4°C. Further dialysis was performed with buffer II (10% v/v glycerol, 0.5M 
arginine, 50mM phosphate buffer, 5.0mM reduced glutathione, 0.5mM oxidised glutathione, 
pH 8.8) for 12-14 hours at 4°C. Protein concentration was estimated using the formula: 
Protein (mg/ml) = (1.55 x OD 28 o) - (0.76 x OD 260 ) 

Immunization 

Balb/C mice were immunized with antigens on days 0, 21 and 35 and sera analyzed at day 49. 
Sera analysis - ELISA 

The acapsulated MenB M7 and the capsulated strains were plated on chocolate agar plates 
and incubated overnight at 37°C with 5% C0 2 . Bacterial colonies were collected from the 
agar plates using a sterile dracon swab and inoculated into Mueller-Hinton Broth (Difco) 
containing 0.25% glucose. Bacterial growth was monitored every 30 minutes by following 
OD 6 2o- The bacteria were let to grow until the OD reached the value of 0.4-0.5. The culture 
was centrifuged for 10 minutes at 4000rpm. The supernatant was discarded and bacteria 
were washed twice with PBS, resuspended in PBS containing 0.025% formaldehyde, and 
incubated for 1 hour at 37°C and then overnight at 4°C with stirring. lOOul bacterial cells 
were added to each well of a 96 well Greiner plate and incubated overnight at 4°C. The wells 
were then washed three times with PBT washing buffer (0.1% Tween-20 in PBS). 200ul of 
saturation buffer (2.7% polyvinylpyrrolidone 10 in water) was added to each well and the 
plates incubated for 2 hours at 37°C. Wells were washed three times with PBT. 200ul of 
diluted sera (Dilution buffer 1% BSA, 0.1% Tween-20, 0.1% NaN 3 in PBS) were added to 
each well and the plates incubated for 2 hours at 37°C. Wells were washed three times with 
PBT. lOOul of HRP-conjugated rabbit anti-mouse (Dako) serum diluted 1:2000 in dilution 
buffer were added to each well and the plates were incubated for 90 minutes at 37°C. Wells 
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were washed three times with PBT buffer. 100^1 of substrate buffer for HRP (25ml of citrate 
buffer pH5, lOmg of O-phenildiamine and \0\x\ of H 2 0 2 ) were added to each well and the 
plates were left at room temperature for 20 minutes. 100^1 12.5% H 2 S0 4 was added to each 
well and OD 490 was followed. The ELBA titers were calculated abitrarely as the dilution of 
sera which gave an OD490 value of 0.4 above the level of preimmune sera. The ELIS A was 
considered positive when the dilution of sera with OD 490 of 0.4 was higher than 1:400. 

Sera analysis - FACS Scan bacteria binding assay 

The acapsulated MenB M7 strain was plated on chocolate agar plates and incubated 
overnight at 37°C with 5% CO2. Bacterial colonies were collected from the agar plates using 
a sterile dracon swab and inoculated into 4 tubes containing 8ml each Mueller-Hinton Broth 
(Difco) containing 0.25% glucose. Bacterial growth was monitored every 30 minutes by 
following OD620. The bacteria were let to grow until the OD reached the value of 0.35-0.5. 
The culture was centrifuged for 10 minutes at 4000rpm. The supernatant was discarded and 
the pellet was resuspended in blocking buffer (1% BSA in PBS, 0.4% NaN 3 ) and centrifuged 
for 5 minutes at 4000ipm. Cells were resuspended in blocking buffer to reach OD 6 20 of 0.05. 
lOOjil bacterial cells were added to each well of a Costar 96 well plate. 100^1 of diluted 
(1:100, 1:200, 1:400) sera (in blocking buffer) were added to each well and plates incubated 
for 2 hours at 4°C. Cells were centrifuged for 5 minutes at 4000rpm, the supernatant 
aspirated and cells washed by addition of 200|jl/well of blocking buffer in each well. 100(il 
of R-Phicoerytrin conjugated F(ab) 2 goat anti-mouse, diluted 1:100, was added to each well 
and plates incubated for 1 hour at 4°C. Cells were spun down by centrifiigation at 4000ipm 
for 5 minutes and washed by addition of 200nl/well of blocking buffer. The supernatant was 
aspirated and cells resuspended in 200|il/well of PBS, 0.25% formaldehyde. Samples were 
transferred to FACScan tubes and read. The condition for FACScan (Laser Power 15mW) 
setting were: FL2 on; FSC-H threshold:92; FSC PMT Voltage: E 01; SSC PMT: 474; Amp. 
Gains 6.1; FI^2 PMT: 586; compensation values: 0. 

Sera analysis - bactericidal assay 

K meningitidis strain 2996 was grown overnight at 37°C on chocolate agar plates (starting 
from a frozen stock) with 5% C0 2 . Colonies were collected and used to inoculate 7ml 
Mueller-Hinton broth, containing 0.25% glucose to reach an OD620 of 0.05-0.08. The culture 
was incubated for approximately 1.5 hours at 37 degrees with shacking until the OD 6 2o 
reached the value of 0.23-0.24. Bacteria were diluted in 50mM Phosphate buffer pH 7.2 
containing lOmM MgCl 2 , 10mM CaCl 2 and 0.5% (w/v) BSA (assay buffer) at the working 
dilution of 10 5 CFU/ml. The total volume of the final reaction mixture was 50 ^il with 25 |il 
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of serial two fold dilution of test serum, 12.5 |il of bacteria at the working dilution, 12.5 |il of 
baby rabbit complement (final concentration 25% ). 

Controls included bacteria incubated with complement serum, immune sera incubated with 
bacteria and with complement inactivated by heating at 56°C for 30\ Immediately after the 
5 addition of the baby rabbit complement, 10|il of the controls were plated on Mueller-Hinton 
agar plates using the tilt method (time 0). The 96-wells plate was incubated for 1 hour at 
37°C with rotation. 7^1 of each sample were plated on Mueller-Hinton agar plates as spots, 
whereas lOpl of the controls were plated on Mueller-Hinton agar plates using the tilt method 
(time 1). Agar plates were incubated for 18 hours at 37 degrees and the colonies 
10 corresponding to time 0 and time 1 were counted. 

Sera analysis - western blots 

Purified proteins (500ng/lane), outer membrane vesicles (5|ig) and total cell extracts (25jjg) 
derived from MenB strain 2996 were loaded onto a 12% SDS-polyacrylamide gel and 
transferred to a nitrocellulose membrane. The transfer was performed for 2 hours at 150mA 

15 at 4°C, using transfer buffer (0.3% Tris base, 1.44% glycine, 20% (v/v) methanol). The 
membrane was saturated by overnight incubation at 4°C in saturation buffer (10% skimmed 
milk, 0.1% Triton X100 in PBS). The membrane was washed twice with washing buffer (3% 
skimmed milk, 0.1% Triton X100 in PBS) and incubated for 2 hours at 37°C with mice sera 
diluted 1:200 in washing buffer. The membrane was washed twice and incubated for 90 

20 minutes with a 1:2000 dilution of horseradish peroxidase labelled anti-mouse Ig. The 
membrane was washed twice with 0.1% Triton XI 00 in PBS and developed with the Opti- 
4CN Substrate Kit (Bio-Rad). The reaction was stopped by adding water. 

The OMVs were prepared as follows: K meningitidis strain 2996 was grown overnight at 37 
degrees with 5% CO2 on 5 GC plates, harvested with a loop and resuspended in 10 ml of 

25 20mM Tris-HCl pH 7.5, 2 mM EDTA. Heat inactivation was performed at 56°C for 45 
minutes and the bacteria disrupted by sonication for 5 minutes on ice (50% duty cycle, 50% 
output , Branson sonifier 3 mm microtip). Unbroken cells were removed by centrifiigation at 
5000g for 10 minutes, the supernatant containing the total cell envelope fraction recovered 
and further centrifuged overnight at 50000g at the temperature of 4°C . The pellet containing 

30 the membranes was resuspended in 2% sarkosyl, 20mM Tris-HCl pH 7.5, 2 mM EDTA and 
incubated at room temperature for 20 minutes to solubilise the inner membranes. The 
suspension was centrifuged at lOOOOg for 10 minutes to remove aggregates, the supernatant 
was further centrifuged at 50000g for 3 hours. The pellet, containing the outer membranes 
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was washed in PBS and resuspended in the same buffer. Protein concentration was measured 
by the D.C. Bio-Rad Protein assay (Modified Lowry method), using BS A as a standard. 

Total cell extracts were prepared as follows: N. meningitidis strain 2996 was grown 
overnight on a GC plate, harvested with a loop and resuspended in 1ml of 20mM Tris-HCl. 
Heat inactivation was performed at 56°C for 30 minutes. 
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CLAIMS 

1. A method for the simultaneous heterologous expression of two or more proteins of the 
invention, in which (a) two or more proteins of the invention are fused. 

2. The method of claim 23, in which the two or more proteins are: (a) 919 and 287; (b) 953 
5 and 287; (c) 287 and ORF46.1; (d) ORF1 and ORF46.1; (e) 919 and ORF46.1; (f) 

ORF46.1, 287 and 919; (g) 919 and 519; and (h) ORF97 and 225. 

3. The method of claim 24, in which 287 is at the C-terminal end of protein (a), (b) or (c). 

4. The method of any preceding claim, in which the expression is in an Exoli host. 

5. A protein expressed by the method of any preceding claim. 

10 6. A hybrid protein of formula NH2-A — B-COOH, wherein A and B are different Neisserial 
proteins. 

7. The protein of claim 6, wherein A and B are each selected from orfl, orf4, orf25, orf40, 
orf46, orf83, 233, 287, 2921, 564, 687, 741, 907, 919, 953, 961 and 983. 

8. The protein of claim 7, wherein A and B are each selected from ORF46.1, 287, 741, 919, 
15 953, 961 and 983. 

9. The protein of claim 8, wherein at least one of said ORF46.1, 287, 741, 919, 953, 961 
and 983 is used in essentially full-length form 

10. The protein of claim 8 or claim 9, wherein at least one of said ORF46.1, 287, 741, 919, 
953, 961 and 983 has a deletion. 

20 1 1. The protein of claim 10, wherein A and/or B has a poly-glycine deletion ('AG'). 

12. The protein of claim 1 1, wherein A and/or B is AG-287, AGTbp2, AG741, or AG983. 

13. The protein of claim 10, wherein A and/or B is a truncated protein, 

14. The protein of claim 13, wherein A and/or B is Al-287, A2-287, A3-287 or A4-287. 

15. The protein of claim 10, wherein a domain of A and/or B is deleted. 
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16. The protein of claim 15, wherein A and/or B is 287B, 287C, 287BC, ORF461-433, 
ORF46433-608, ORF46, or 961c. 

17. The protein of claim 6, wherein A and B are: (a) 919 and 287; (b) 953 and 287; (c) 287 
and ORF46.1; (d) ORF1 and ORF46.1; (e) 919 and ORF46.1; (f) ORF46.1 and 919; (g) 

5 919and519;or(h)ORF97and225. 

18. The protein of claim 17, wherein the protein is AG287-919, AG287-953, AG287-961, 
AG983-ORF46.1, AG983-741, AG983-961, AG983-961C, AG741-961, AG741-961C, 
AG741-983, AG741-ORF46.1, ORF46.1-741, ORF46.1-961, ORF46.1-961C, 961- 
ORF46.1, 961-741, 961-983, 961C-ORF46.1, 961C-741, 961C-983, 961CL-ORF46.1, 

10 961CL-741,or961CL-983. 

19. The protein of claim 8, wherein A or B is 287. 

20. The protein of claim 19, wherein B is 287 

21. The protein of claim 19, wherein A is AG-287 

22. The protein of claim 21 , wherein B is ORF46, 919, 953 or 96 1 . 

15 23. The protein of any one of claims 19 to 22, wherein 287 is from strain 2996 or 394/98f 

24. The protein of claim 8, wherein A is 961. 

25. The protein of any one of claims 6 to 24, wherein A and B are from the same strain. 

26. The protein of any one of claims 6 to 24, wherein A and B are joined directly 

27. The protein of any one of claims 6 to 24, wherein A and B are joined via a linker peptide. 

20 28. The protein of claim 27, wherein the linker peptide is a poly-glycine linker, with the 
proviso that B is not a AG protein. 



WO 01/64920 



PCT/IB01/00420 

» 



1/5 



FIGURE 1 — AG287—919 
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FIGURE 2 — AG287—9S3 
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FIGURE 3 — AG287—961 
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FIGURE 4 — AG287NZ—919 
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FIGURE 5 — AG287NZ—953 
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FIGURE 7 — AG983-ORF46.1 
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FIGURE 8 — AG983-741 




FIGURE 9 — AG983-961 
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FIGURE 10 — AG983-961C 
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FIGURE 11 —AG741-961 
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FIGURE 12 — AG741-961C 




FIGURE 13 — AG741-983 




FIGURE 14 — AG741-ORF46.1 
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FIGURE 15 — ORF46.1-741 




FIGURE 16 — ORF46.1-961 
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FIGURE 18 — 961 -ORF 46.1 




FIGURE 19 — 961-741 






FIGURE 22 — 961C-741 
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FIGURE 25 — 961cL-741 




FIGURE 26 — 961CL-983 
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